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THE STRUCTURE OF INTELLECT 
J. P. GUILFORD 
University of Southern California 


It is the purpose of this report to 
describe a developing picture of the 
structure of human, adult intellect, 
as seen in terms of factors. Although 
the picture is incomplete, presenting 
it at this time seems desirable for two 
reasons. The picture now includes 
about forty different factors, most of 
which are generally unfamiliar. Many 
have only recently been demon- 
strated. Enough of the intellectual 
factors are known to suggest strongly 
the outlines of a system. The system 
has interesting theoretical implica- 
tions, and, by reason of certain vacan- 
cies that appear, it points to still un- 
discovered tactors, somewhat as the 
chemist’s periodic table has served 
to indicate unknown elements. 

As the writer has emphasized be- 
fore (10, 13), psychology and psychol- 
ogists since Binet have taken a much 
too restricted view of human intelli- 
gence. We do not need to go into the 
reasons here. They can be summed 
up in a positive manner by saying 
that in attempting to fathom the na- 
ture of intellect attention 
should be given to the human adult, 
particularly the superior human adult. 
It is to such specimens that we must 
go, if we are to investigate intellectual 
qualities and functions in their great 
est scope and variety. 

The advent of multiple-fuctor an 
alysis has done something to broaden 
and enrich our conception of human 
intelligence, but factor theory and 


more 


the results of factor analysis have 
had little effect upon the practices of 
measurement of intelligence. We do 
have a great variety of tests in such 
intelligence scales as the Binet and 
its revisions and in the Wechsler 
scales, to be sure. Too commonly, 
however, a single score is the only 
information utilized, and this single 
score is usually dominated by vari- 
ance in only one or two factors. 
There is some indication of more gen- 
eral use of part scores, as in connec- 
tion with the Wechsler tests, but each 
of these scores is usually factorially 
complex and its psychological mean- 
ing is largely unknown as well as am- 
biguous. The list of factors that is 
to be presented in this article should 
clearly demonstrate the very limited 
information that a single score can 
give concerning an individual, and 
on the other hand, the rich possi- 
bilities that those factors offer for 
more complete and more meaningful 
assessments of the intellects of per- 
sons. 

Some seven vears ago the writer 
initiated research aimed essentially 
at the study of adult, human intelli- 
gence, in a project on ‘aptitudes of 
high-level personnel.”' In some re- 


Project 150-044, under Contract N6onr- 
23810, with the Office of Naval Research, 
monitored by the Personnel and Training 
Branch. Among those who have made the 
most significant contributions to the project 
are: Raymond M. Berger, Paul R. Christen- 
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spects this has been a continuation of 
wartime research in the AAF Avia- 
tion Psychology Research Program 
(21). The project was initiated with 
the conviction that the full scope of 
human intellect had not yet been ex- 
plored, by factor-analysis methods 
or by any other methods. Thinking 
abilities, which have played impor- 
tant roles in some definitions of in- 
telligence, seemed to have been ne- 
glected; particularly abilities having 
to do with productive thinking. Ac- 
cordingly, four areas of thinking were 
selected for study, arbitrarily desig- 
nated as reasoning, creativity, plan- 
ning, and evaluation. While abilities 
belong to the context of individual 
differences, they also imply psycho- 
logical functions of individuals. 
Hence it was thought that the find- 
ings would have much to offer toward 
an understanding of human thinking 
of various kinds, including problem 
solving. 

Space does not permit describing 
in detail the research procedures; 
they have been described in the vari- 
ous technical reports from the apti- 
tudes project (14, 15). It should be 
pointed out, however, that the factor 
analyses were done in a research de- 
sign that includes experimental fea- 
tures. Each investigation starts by 
hypothesizing that certain unitary 
abilities (psychological factors) exist 
and that they have certain proper- 
ties. Psychological tests are then 
selected, adapted, and constructed 
for each hypothesized factor in a 
way that should lead to a 
“no” answer from the analysis. The 
results should show that the factor 
hypothesized does or does not exist 


“ves” or 


sen, Andrew L. Comrey, Russel F. Green, 
Alfred F. Hertzka, Norman W. Kettner, and 
Robert C. Wilson. I am particularly indebted 
to Christensen and Kettner for reading the 
preliminary draft of this paper, and to Philip 
R. Merrifield, also, for making suggestions. 
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and it does or does not have the prop- 
erties suggested. Thus, the kind of 
psychological test is an important 
independent variable, more or less 
under the control of the investigator. 
Certain other experimental variables 
are held relatively constant—the 
testing conditions and certain popu- 
lation features, such as sex, age, edu- 
cation, and motivation. The exam- 
inees have been men who were pre- 
viously selected for military training 
leading to an commission 
and they have tested under 
ordinary military discipline. 

In his survey of aptitude factors, 
published in 1951, French (8) listed, 
among others, 18 or 19 factors that 
can be classified as intellectual. Our 
thinking abilities 
have verified and helped to clarify 
many of these factors, besides intro- 
ducing approximately as many new 
Other recent investigations 
have also contributed new informa- 
tion regarding factors. The list pre- 
here from all these 


officer's 
been 


investigations of 


ones. 


sented comes 


sources, 


CLASSES OF INTELLECTUAL FACTORS 


Inspection of the total list shows 
that the intellectual factors fall into 
major groups—thinking and 
memory factors. The great majority 
of them can be regarded as thinking 
factors. Within this group, a three- 
fold division appears 
covery) factors, production factors, 
and evaluation factors. The produc- 
tion group can be significantly sub- 
divided 
thinking abilities and a class of di- 
vergent-thinking abilities.’ 


two 


cognition (dis- 


into a class of convergent- 


Cognition (Discovery) Factors 
The cognition factors have to do 
with becoming aware of mental items 
? In the system of the intellectual factors to 


be described here, the reader will find some 
striking similarities to a system developed in- 
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or coustructs of one kind or another. 
In the tests of these factors, some- 
thing must be comprehended, recog- 
nized, or discovered by the examinee. 
They represent 
ceiving side of behavior sequences, 


functions on the re- 


The cognition abilities can be dif- 
ferentiated along the 
major principles. For some time we 
have been aware that thinking fae- 
tors tend to pair off according to the 
material or content used in the tests. 
For each factor of a certain kind 
found in verbal tests there seemed to 


lines of two 


be a mate found in tests « omposed ot 
figures or designs. We found, for ex- 
ample, a factor called eduction of 

parallel with a 


called eduction of 


, ' 
perceptual relation 
factor 


relation 


? 
COnCcE ptual 


) 
a factor called perceptual 


foresight, parallel to one called con- 


ceptual foresight 


- and a factor of per- 
parallel 
one of conceptual classification, 
has 
third 


were 


with 
Only 


ink reasiny 


ceplua cia rcation, 


recently there been 
evidence for a 


| actors 


content cate- 
found in 


gory. tests 
whose contents 
lent 


ceived 


ire letters, or equiva- 
neither 
form or tigure = nor 


svmbols, where per- 


verbal 
meaning is the basis of operation 
Factors based upeor 
terial 


other factors where the test ¢ 


this tr pe ol ma 


have been found, parallel to 
ontent 
is figural or verbal. Thus a third con 
tent category seems necessary. 

A second major principle by which 
cognition factors mav be differenti- 
ated psychologically depends upon 
the kind of thing discovered; whether 
it is 


a relation, a class, or a pattern, 


and so on. Thus, for each combina- 
tion of content and thing discovered, 
we have a potential factor 


nition 


The cog- 
factors can therefore be ar- 
ranged in a matrix as shown in Table 
1. The third and fourth rows seem 
to be complete at the present time. 
dependently by Burt (2). The similarities are 


support for the idea that a system does exist. 
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There are vacancies in the other four 
rows. With each factor name are usu- 
ally given two representative tests by 
name to help give the factor opera- 
tional meaning.* A word or two will 
be said in addition regarding the less 
familiar tests.+ 

It should not be surprising to tind 
the factor of verbal comprehension, 
the best known, and the dominant 
one in verbal-intelligence tests gen- 
erally, in the first row of the cogni- 
tion factors and in the conceptual 
column. The fact that the cognition 
sometimes threes 
leads us to look for parallel factors 
for the perceptual and structural 
columns. One candidate for the 
perceptual cell in this row would be 
the well-known factor of perceptual 
speed. This factor has to do with dis- 
criminations of small differences in 
form rather than in 
total figures, hence it does not quite 
fill the requirement of parallel prop- 
erties with verbal comprehension. A 
better factor for this purpose is the 
one Thurstone (28) called “speed and 
strength ot called figural 
closure in Table 1. For this factor, 
awareness ot perceived objects trom 
limited cues is the key property. The 
limitation of necessary to 
make the test sufficiently difficult for 
testing purposes. 

There is no 


factors come in 


awareness ol 


closure,” 


cues is 


known factor that 
seems to belong in the second column 
of the first row of Table 1.) In gen- 
eralizing the class of three such fiic- 
tors, differentiation from 
other classes in Table 1, it is clear 
that those in the first row have to do 
with awareness of items, elements, or 
things. To denote this category 
Spearman's term “‘fundament” has 
been adopted. 


and in 


It should not be inferred that these are the 
only kinds of tests related to the factor. 
‘ For more complete descriptions of the tests 


see particularly (14, 17, 21). 
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TABLE 1 


CoGnition (Discovery) Factors 


Type of thing 
known or 
discovered Figural 


Fundaments Figural closure 

Street Gestalt 
Completion 

Mutilated Words 


Perceptual classifica 
tion 
Figure Classification 
Picture Classification 
Kelations Eduction of perce p- 
tual relations 
Figure Analogies 
Figure Matrix 


Patterns or systems Spatial orientation 
Spatial Orientation 
Flags, Figures, Cards 


Problems 


Perceptual foresight 

Competitive Plan 
ning 

Route Planning 


Implications 


Two factors involving ability to 
recognize classes are known, one in 
which the class is formed on the basis 
of figural properties and the other on 
the basis of meanings. It was inter- 
esting that the Picture Classification 
test had more relation to the percep- 
tual-classification factor than to the 
conce ptual-classification factor in spite 
of the fact that the things to be classi- 
fied were common objects, the basis 
for whose classification was intended 
to be their meanings. This might 
mean that the perceptual-conceptual 
distinction is a somewhat superficial 
matter, pertaining only to how the 


material is presented. It is possible, 


Type of content 


Structural 


Eduction 
tural relations 
Seeing Trends II 
Correlate 
tion Il 


of struc 


Eduction of patterns 


Circle Reasoning 
Letter Triangle 


Comple 


Conceptual 
Verbal comprehension 
Vocabulary 


Ve rbal cla 


Word Classification 
Verbal Classification 
Eduction of concep 
tual relation 
Verbal Analovies 
Word Matrix 


General reasoning 
\rithmetic Reasoning 


Ship Destination 


Sensitivily to pr 
Sex Ing P 
Seeing Deficiencies 


roblems 


Conceptual foresi 
Pertinent Questions 


Alternate Methods 


*" 
al 


Penetration 

Social Institutions 

Similarities 
that in many of the items 
in this test the general shapes and 
sizes and other figural properties are 
an aid in classification. For example, 
there are cleaning implements, con- 
tainers, etc., in some items, where 
similarities of appearance may serve 
as clues. 

The difference between the Word 
Classification test and the Verbal 
Classification test is largely in the 
form of presentation of the problems. 
A sample item from the Word Classi- 
fication test is: “A. horse B. cow C. 
man D. flower."" Which word does 
not belong? In the Verbal Classifica- 
tion test, two short lists of words are 


however, 
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given to establish two classes, e.g., 
animals and pieces of furniture. A 
longer list of words is given, each 
one of which must be marked as be- 
longing to one class or the other or 
to neither class 

Is there likely to be a factor having 
to do with the seeing of classes when 


class membership depends upon struc- 


tural properties? Such a factor would 
We have much to 
learn regarding the scope of struc- 
tural ideas. Thus far, structural fac- 
tors have been found only in tests 
utilizing letters and very simple forms 
circles, dashes, and the like. 
One can raise the question whether 
mechanical conceptions, for example, 
There is also the 
question of where figural properties 


be reasonable. 


such is 


belong in this class. 


end and structural properties begin, 
iso of where structural properties 
end and conceptual properties begin. 
We mav actually have a continuum 
With respect to 
including classes, fundaments, 
etc.) there may be a rapid transition 
from figural to conceptual, thus leav- 
ing no basis for a third factor. It is 
likely that the factors in any row of 


here some cate- 


VOTiCs 


Pable 1 are positively and sometimes 
substantially cor related, Phe 
question ot correlations 
will be left for later 


even 
general 
among tactors 
discussion. 

We have a complete triad of fac- 
tors having to do with the seeing of 
relationships and tests to measure 
them that are similar except for con- 
tent. The analogies tests are well 
known. A matrix test is essentially a 
two-dimensional 
amples of which may be found in the 
Raven Progressive Matrices 
In the test Seeing Trends I], we find 
the following tvpe of item: “anger 
camel excite.” The 
examinee is to name the letter trend, 
which, in this item, of course, is that 
the initial letters are in alphabetical 


anale g1es test, @X- 


series. 


bacteria dead 


271 
order from “‘a’’ to “e.”’ In the Corre- 
late Completion II test, an illustra- 
tive item reads: “am ma_ not ton 

-’: what word should come 
Here it is not word meaning 
that is important but letter se- 
quences. In the Seeing Trends II 
test, likewise, the word meanings are 
of no significance. Presumably, an 
utilizing letters only 
would do as well as a measure of this 
factor. 

In the row of Table 1 pertaining to 
patterns or systems, we have three 
factors, but they are much more dis- 
parate in kind than usual in this ta- 
ble. The clearest example of an educ- 
tion-of-patterns factor is in the 
middle column. The Circle Reason- 
ing test, adapted from Blakey (9), ts 
similar to the Marks test of Thur- 
stone and to the Spatial Reasoning 
test of the AAF (21). In a sequence 
of symbols the examinee must dis- 
cover the principle by which certain 
symbols are marked, then he must 
mark a new set accordingly. In the 
Letter Triangle test, the letters are 
arranged in a different alphabetical 
pattern in each item. The examinee 
must discover the pattern and show 
this by filling a blank with a letter. 

Under the figural category we find 
the spatial orientation, a 
well-known space factor. It is best 
detined as the ability to become 
aware of the spatial order or arrange- 
ment of objects perceived visually. 

Until the system of cognition fac- 
tors was conceived, the writer had 
thought of spatial orientation as a 
purely perceptual ability rather than 
intellectual.’ Its place in the system 
is regarded as tentative. We may 
vet find another seeing-patterns fac- 
tor in which figural properties play a 
more obvious role than they do in the 


tool 
next? 


analogies test 


factor ol 


§ A perceptual factor is distinguished from 
an intellectual factor by the fact that no sym- 
bolic activity is clearly involved. 
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factor of spatial orientation. In a 
real orientation within a 
field of perceived objects is a pattern 
or system, where spatial arrange- 
ment, which includes the viewer, ts 
the principle. Shapes and sizes of ob- 
jects, Which play a more obvious role 
in the case of the other figural fac- 
tors, are of more indirect significance 
in spatial orientation. 

Under the conceptual category we 
find a factor that has been most dif- 
ficult to define. The best conception 
of it is that it represents an ability 
to define or structure problems. It 
has compo- 
nent of arithmetic-reasoning tests, but 
since such tests are psychologically 
complex, it has been difficult to de- 
termine just what aspect of solving 
problems of this type is the signifi- 
cant feature that requires the ability 
called general reasoning. By elimina- 
many rival hypotheses, it 
rather clear that the factor 
pertains to the comprehension of the 
structure of a problem, at least of the 
arithmetical variety (19). Since such 
a structure is conceptual, the factor 
logically belongs in the column where 
it is placed in Table 1. The Ship 
Destination test is a Spec ial tvpe ol 
arithmetical-reasoning 


sense, an 


been a most consistent 


tion of 
is now 


which 
seems to come closer than any other 


test, 


to being a pure measure ot the factor. 

In the next row of Table 1, for the 
discovery of problems, there is only 
factor—sensitivity to problems, 
which is in the conceptual column. 
The appearance ot this factor parallel 
to general reasoning in the row pre- 
ceding, emphasizes the well-known 
observation that it is one thing to be 
aware that a problem exists and an- 
other thing to be aware of the nature 
of the problem. The titles of the tests 
are quite descriptive. A sample item 
from the test Seeing Problems asks 
the examinee to lst as many as five 
problems in connection with a com- 


one 
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The test 
Seeing Deficiencies presents in each 
item the general plan for solving a 
given problem, but the plan raises 
some new problems. What are those 
problems? 

Whether we shall ever tind parallel 
factors for seeing problems or 
figural and 
types remains to be seen. 


mon object like a candle. 


de- 
structural 
Problems 
of a figural ty pe are faced in aesthetic 
pursuits such as painting and archi- 
tecture. Problems of a structural 
type might be faced in connection 
with spelling or the development of 
language. 


ficiencies of 


Tests pertaining to the 
seeing of problems have thus far pro- 
vided no figural or structural bases 
for problems. It should be relatively 
easy to test the hypothesis that such 
factors exist. If they do exist, their 
implications for everyday 
performance need further study. 

In the investigation of planning 
abilities (14, 15), two parallel fac- 


were found 


}« ssil le 


tors perceptual fore- 
sight and conceptual foresight where 
Was expected. 
Planning test 
by the AAF psychologists as a test 
21) It 


requires the examinee to imagine that 


A 
one 


The Competitive 
was originally designed 


of foresight and planning 
he is plaving the game of completing 


He plays 


Cail h 


squares by drawing lines. 
for the two opponents and in 
item he tell the 
number of squares each 
can complete under the rules of the 
The Route Pl inning test, al- 
other AAF product, is a type of maze 
problem. The must say 
which of alternative points will have 
to be passed through in going from 
the starting point to the goal. In 
both lavouts are 
used. 

The test Pertinent Questions pre- 
sents in each item a need for a deci- 
and is asked to 
state what facts he should consider 


has to maximum 


opponent 


game. 


examinee 


tests, perceived 


sion the examinee 
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For example, 
a new graduate is offered positions 
What should 
be the deciding considerations? In 
the Alternate Methods test, a practi- 
cal problem is given, with available 
objects that may be used. The ex- 
aminee is to give several alternative 


in reachu ga decision. 


in two different cities. 


solutions that he considers most ade- 
quate 
Porteus has that his 
maze measure tore- 
sight He can claim) support 
from the factor-analvysis results just 
The 


maze tests, 


maintained 
series of tests 


well 


foresight 
however, is 
This ability 
may be important for the architect, 
ngineer, and the industrial-lay- 
It ma 
abstract type ol plan- 
find in the political 
ind the policy maker. So 
results go, the maze test 


mentioned tvpe ot 
measured by 
of a concrete variety. 
he 


out planner v not be found re- 
lated to the 
ning that we 
strateuist 
iar as our 
should by 


test of vel eral 
might 


no means be offered as a 
This 


modification, 


i telligence. 
statement need 
however, after the maze test is factor 
analyzed in a population of lower 
LCTIE ral intelle tual level where ven- 
eral intelligence is detined operation- 
ally as an of all intellectual 
abilities In a population of “high- 


level personnel,” we can say that a 


AVCTALE 


maze test Measures most strongly the 
tor of 
identallyv, to the 
tors of visualization and ada ptive 
fexibility (1S). 

Lhe appearance ob a tactor called 
the last column of 
alone with conceptual fore- 
A factor of 
penetration was hypothesized in the 
first analvsis of creative abilities and 
was not found (31 An unidentified 
factor found there might well have 
A tactor has been 
so identified in a more recent analysis 
that emphasized creative ability tests 


} 


. , 
perceptual foresight and, 


some degree 


penetration in 
Fable 1, 


ight, calls for comment. 


been penetration 


(20). It is strongly loaded on a test 
called Social Institutions, which asks 
what is with well-known in- 
stitutions such as tipping. It was de- 
signed as a test of sensitivity to prob- 
lems, and it has consistently had a 
loading on that factor. In the first 
creativity analy sis, two scores were 


wrong 


based upon this test; one being the 
total number of low-quality or obvi- 
ous defects and the other was the 
total number of high-quality or “pen- 
etratine’’ defects that can 
be seen only by the far-sighted per- 
son. As a matter of fact, the 
scores had much to do with etfect- 


defects 
two 


ing a separation of the seeing-prob- 


lems tests into two groups, one of 


which might have been identified as 
the penetration factor. 

It is quite possible that the factor 
ot penetration and the factor of con- 


ceptual foresight are one and the same. 
Thev came out in two different 
alvses that had no crucial 
common. It would be a good hvy- 
pothesis that they are identical and 
a good prediction would be that if the 
Fable 1 


battery 


all- 
tests in 


four tests listed in 
the 


were all- 
alyzed in thes 
would detine a single factor, not two. 

There is the apparent possibility 
for the existence of a foresight factor 
involving structural arrangements, 
but the scope and usefulness of such 
a factor would seem to be question- 


able. 


same 


Production Factors—Convergent Think 
ing 

The second large group ol think- 
ing factors has to do with the produc- 
tion of some end result. After one 
has comprehended the situation, or 
the significant aspects of it at the 
moment, usually something needs to 
be done to it or about it. In the an- 
alowies test, for example, having seen 
the relation between the first pair ot 
elements of an item we must then 
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find a correlate to complete another 
pair. Having understood a problem, 
we must take further steps to solve 
it. 

Like the cognition factors, the 
production factors show some prom- 
ise of falling under the rubrics of 
figural, structural, and conceptual, 
but here the picture is less complete. 
The kinds of things produced are 
more numerous than the kinds dis- 
covered. There are no identities of 
things in the two lists, but there are 
a few parallels or relationships. For 
example, corresponding to the com- 
prehension of words, there are factors 
concerned with the production of 
words; corresponding to the discov- 
ery of classes there is the act of nam- 
ing; corresponding to the discovery 
of relations there is the production of 
correlates; and corresponding to the 
discovery of systems there is the pro- 
duction of order. But with these 
few the connections and 
parallels seem to end. 

It was announced earlier that 
production fall into two 
groups —convergent-thinking fac- 
tors and divergent-thinking factors. 
Such a distinction seems not to have 
been emphasized in prior literature 
on thinking. In the case of some of 
the production factors, the distine- 
tion is not complete, but in most cases 
it is striking. 

In convergent 


mstances, 


the 
factors 


thinking, there is 
usually one conclusion or answer that 
is regarded as unique, and thinking 
is channeled or controlled in the di 
rection of that answer. In tests of 
the convergent-thinking factors, there 
is one keved to each item. 
Multiple-choice tests are well adapted 
to the measurement of these abilities. 
In divergent thinking, on the other 
hand, there is much searching or go- 
ing off in various directions. This is 
most clearly seen when there is no 
unique conclusion. For the measure- 


answer 
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ment of such abilities, completion 
tests are almost a necessity. The 
distinction is not so clear in some 
problem-solving tests, in which there 
must be and usually is some diver- 
gent thinking or search as well as ulti- 
mate convergence toward the solu- 
tion. But the processes are logicalls 
and operationally separable, even in 
such activities. 

In Table 2 we have those produe- 
tion factors identified as dealing with 
convergent thinking. There are five 
potential triads of factors, depending 
upon the kind of result produced 
names, correlates, orders, changes, or 
unique conclusions. In two 
structural-tyvpe tests have figured in 
factors, thus a three-column matrix 
has been again adopted. 

In the first row are factors having 
to do with the production of names. 
The two factors there are 
trasted in terms of the 
abstract dichotomy. They differ, 
also, bv the fact that the one has to 
do with the naming of particulars 
while the other has to do with the 
naming of classes. French (8) lists a 
factor of naming, which been 
called object naming here to distin- 
guish it from the factor of abstraction 
naming, which was just recently dis- 
covered. The appearance of a 
of Color Naming under the rubric 
of “figural’’ calls for broadening the 


cases 


again con- 


concrete- 


has 


test 


conception of this class to recognize 
color as a figural property. Classes ot 
objects distinguished for their strue 
tural properties are evidentls 
very common. If good examples can 
be found, we may find a third nam- 
ing factor. In the name of the factor 
of abstraction naming, the term “ab 
straction” may prove to be too com 
The two. illustrative 


hot 


prehensive. 


tests mentioned might suggest that 
the ability is restricted to the nam- 
The results show that 
it is actually broader than that, since 


ing of classes. 
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rABLE 2 


PRODUCTION FACTORS—CONVERGENT THINKING 


Ty pe of result 


produc ed =. 
I Figural 
Names 


Object naming 
Form Naming 


Color Naming 


Correlates 


Type of Content 


Structural 


Conceptual 


Abstraction naming 

Picture-Group Nam 
ing 

Word-Group Naming 


Eduction of correlate 


Correlate Completion 
Figure Analogies Completion 


Visualization 


Spatial Visualization 


Pun hed Holes 


Form Reasoning 


it pertains to the naming of relations 
also, in other tests. 

With three factors having to do 
with the seeing of relationships, we 
might well expect three correspond- 
ing factors concerned with the educ- 
tion of correlates. As a matter of 
fact, the project has for some time 
anticipated at least two such factors, 
perceptual and conceptual, and has 
designed tests that were expected to 
effect the expected separation. To 
this date, eduction-of- 
correlates factor has been clearly in- 
dicated, and both figural and struc- 
tural tests have loadings on it. The 
Verbal Analogies Completion test, 
which we hoped would help to dis- 
tinguish a conceptual-correlates fac- 
tor, turned out to be a test of expres- 
sional fluency. Evidently the educ- 
tion-of-correlates aspect of the test 
was made so easy that little variance 
in this ability, if it is separate, was 


only one 


Ordering 

Picture Arrangement 

Sentence Order 

Re definition 

Gestalt Transforma- 
tion 

Object Synthesis 


Numerical facility Symbol manipulation 


Numerical Opera- Symbol Manipulation 


tions 
Sign Changes IT 


manifested. On the other hand, hav- 
ing educed the correlate, thinking of 
the needed word provided the chief 
individual differences in 
scores, and hence the loading on ex- 
pressional fluency. It can be pre- 
dicted that with the appropriate 
tests, three eduction-of-correlates fac- 
tors will become evident. Because of 
the difficulty of separating them, it 
can be predicted that the intercor- 
relations of these three factors will be 
found to be substantial. 

In the investigation of planning 
abilities it was hypothesized that 
there would be an ability to see or to 
appreciate order or the lack of it, as 
a feature of preparation for planning. 
It was also hypothesized that there 
would be an ability to produce order 
among objects, ideas, or events, in 
the production of a plan. A single 
ordering factor was found. Since the 
three tests designed to measure sensi- 


basis for 
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tivity to order had low and insignit- 
cant loadings on the factor, while the 
three designed to measure the pro- 
duction of order had significant and 
even substantial loadings, the factor 
seems to belong among the produc- 
tion factors. The Picture Arrange- 
ment test presents a flour-part car- 
toon strip in which the parts are out 
ol correct temporal order. The ex- 
aminee has to state the best order. 
The Sentence Order test presents in 
each item three sentences, each stat- 
the examinee 
told to rearrange them. 

It remains to. be 
ordering in terms of figural and 
structural properties will) call for 
additional ordering factors to help 
complete the matrix of Table 2. 
Figural ordering mav be a significant 
aspect of pictorial art. It 
to see where a structural order- 


ing an event, being 


seen whether 


is not so 
easy 
ing would be of consequence. 

In the next row of Table 2 we tind 
the factor of visualization, which has 
been known for some time, and the 
factor of redefinition, which was found 
originally in the first creativity an- 
alvsis (31 The thing produced in 
both instances is some kind of change 
or rearrangement or shift. The Spa- 
tial Visualization test is Part VI of 
the Guilford-Zimmerman Aptitude 
In each item certain move- 
ments of a pictured alarm clock are 
indicated and the examinee is to 
select the view that would be seen af- 
ter the movements. The Thurstone 
Punched Holes test shows a paper 
being folded and a hole or holes then 
cut out. The examinee is to tell how 
the paper would look after unfolding. 

The redefinition factor involves 
shifts of meaning or use of objects or 
parts of objects. The test Gestalt 
Transformation asks such questions 
as: With which of the following ob- 
jects could one best start a fire: A. 
fountain pen, B. onion, C. pocket 


Survey. 
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watch, D. light bulb, E. bowling ball? 
The keved answer is C the 
crystal can be transformed from a 
face cover to a condensing lens. The 
Object Synthesis test asks such ques- 
Given pliers and a_ shoe- 
string, what could you make? A good 
answer would be “pendulum” or 
“plumb bob.” the ob 
jects play new roles in the combina- 
tion. 

The last row of factors in Table 2 
presents an interesting triad. Al- 
though there are one or two questions 
that can be raised about their place- 
ment, to be mentioned later, it is 
quite clear that they all involve rigor- 
ous operations with symbols leading 
The factor ot 
numerical facility is the very well- 
known ability to operate with num- 
bers, where both speed and accuracy 


since 


tions as: 


In either case 


to unique conclusions 


are significant. The two new factors, 
symbol substitution and 
tpulation, were regarded as one fac- 
tor until recently. In one 
the factor looked like a substitution 
ability. and in 
looked like a manipulation ability. 
In a recent (20 the 
were found to be separate 

To distinguish these 
must consider the different 
that represent the two In 
Sign Changes, the examinee is told 
before each block of items what inter- 
changes to make in algebraic signs, 
with xX’ 
place + with —.”’ He applies the 
new rules to several simple equations 
such as “3—6=?" “64+2=?.” 
In the Form Reasoning test, equa- 
tions are stated in the form of com- 
binations of simple geometric forms. 
Some definitions are first given, stat- 
ing that a combination of two forms, 
such as a star and a circle, can be re- 
placed by another single form, a 
square. With these substitutions of 
single forms for pairs, combinations 


ymbol man 
analysis 
another analvsis it 
analvsis two 
factors, we 


kinds ot 


tests 


2... “replace _ and “‘re- 


and 
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greater than pairs must be reduced to 
single symbols, taking each pair in 
turn. 

It is ditteult to ac cept fully the 
placement of symbol substitution in 
the figural column. If all tests loaded 

like Form Reasoning, 
rigorous definitions and 


on it 
where 


were 
the 
operations are all in terms of figures, 
would be quite rea- 
But certain features of the 
Sign Changes test suggest that it is 


the pl icement 
sonable. 


not figural properties, as such, that 
Thev may 
the svmbols. In 


Changes test 


are important. serve 


merely to identity 


the Sict 


it is the opera- 
it the svmbol stands for that 


tion th 
is import int 
The Sign Changes 


test Was origi- 


—- 
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nally designed as a flexibility 
the Form Reasoning test was not. In 
both, the switch the 
meaning or significance of symbols ts 
the obvious peculiar feature. Per- 
haps the emphasis should be placed 
on the word “switch.” It may be 
that this factor will eventually be 
placed in the family of  flexibilits 
which appears in Table 3. 
There is no evidence against the hy- 
pothesis that symbol substitution is 
the the present factor of 
adaptive flexibility, represented par- 
ticularly by the Match Problems 
test. As a matter of fact, Sign 
Changes had a significant loading on 
adaptive flexibility in the creativity 
analysis (31 Form Reasoning has 


tse 5 


readiness to 


factors, 


Same as 


> . 
> y 


Propuction Factors 


Flexibility of closure 
Hidden Pictures 
Gottschaldt \ 


Novel respor 


Details aboration* 


“dl 
od 


I 
I 
| 


igure Production 


* At present regarded as the same factor 


lanning Elaboration 


DIVERGENT THINKIN 


ype of Content 


Structural Conceptual 
Word fluency ciat 
trolled 
tions Il 
\ssociations ITT 


mal fluencv 


Prefixes \ssocia 


r 
idea nal fluer 


Plot Titles 


C or sequel 


Expressional fluency 
Vocab ary 


tion 


Compl 


{da ptive Rexibility 
Match Problems Brick Uses 
Planning Air Ma | 


neuvers 


Spontaneous flexthilit 


nusual Uses 


( ’riginality 

Plot Titles 
(cleverness 

Symbol Production 


Elaboratiton* 
Planning Elaboration 
Figure Production 


two separate tactors, 
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never had an opportunity to show 
such a loading. 

Defining the factor of symbol man- 
ipulation are the two tests Symbol 
Manipulation and Sign Changes II. 
Symbol Manipulation provides some 
simply defined symbols, such as: E 
means equal to; NG means not 
greater than. Each item then pro- 
vides a statement such as: xEv and 
vNGz; which of the following state- 
ments can logically be made: xSz, 
xNGz, etc. This test was designed 
originally for the factor of logical 
evaluation Table 4), and has 
usually shown some relationship to 
that factor, but it also helps to define 
the factor of symbol manipulation. 

The test Sign Changes II presents 
simple “equations” 1+2 
=4 x1, the two sides of which are 
not actually equal as the statement 
stands. The examinee is to sav what 
interchange of algebraic will 
make the equation correct. In the 
illustration just given, if X and — 
are interchanged the equation will 
balance. 


(see 


‘ 


such as 


signs 


From these two tests alone, it is 
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not easy to see exactly what kind of 
ability is involved in common. One 
clue may be that both tests involve 
equations. A third test with a sig- 
nificant loading in one analysis is a 
number-series test. This test 
not involve equations. In one an- 
alvsis the mumerical-facility factor 
was distinct from symbol mani pula- 
tron, consequently we cannot identify 
the latter with the former. Further 
intensive work is obviously needed 
in the area of these factors. Abilities 
that may be of some significance for 
mathematics mav_ be 


does 


success in 
found here. 


Production Factors —Divergent Think- 


ing 

The divergent-thinking factors are 
arranged in a matrix in Table 3, 
with the three column categories that 
have now become familiar. Here 
there are more vacancies to be filled, 
if the system is indeed as applicable 
as it promises to be. 

In the first three rows of the table 
we find the four well-established flu- 


encyv factors. In the first row are the 


rABLE 4 


EVALUATION FacToRS 


Ivpe of Content 


Figural 


{ Per é ptual evaluation - 
Ratio Estimation 
Figure Estimation 


Leneth estimation 
Pattern Assembly 
Shorter Path 


Structural 


Conceptual 
Logical evaluation 
Logical Reasoning 
Inferences 


Experiential evaluat: 


Unusual Details 


Judgment 
Practical Judgment 
Practical Estimation 


Speed of judgment 
Color-Form Sort Time 
Social Judgments Time 


* Probably a composite of factors, including length estimatio: 
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two fluency factors having to do with 
the production of single words. In 
the case of the factor of word fluency, 
meaning is of no importance. The 
usual tests of this factor merely spe- 
cify that the words shall begin or end 
with a specified letter, prefix, or suf- 
fix. Only such structural require- 
ments are to be met. The examinee 
need not even know the meanings of 
the words he gives. In the case of 
associational fluc NCY, however, mean- 
The 
words given must be synonyms, as 
in Controlled 


ing is an essential requirement. 


Associations Il, or 
must be related in some meaningful 
wavy to stimulus words or ideas. In 
Controlled Associations II, 


aminee gives as man\ 


the ex- 
as three svn- 
stimulus word. In 
Il], two words are 
given, differing in meaning, and the 


onvms to each 


Associations 


examinee must give one word that isa 
svnonvm to both. 
word “‘le”’ 


For example, the 
would be given as a svno- 
nvm to both “recline”™ and 

It does not seem very 
an abilits 


“deceive.” 
likely that 
will be found for the first 
cell in Row 1 of the table. This would 
call for the production of words satis- 
iving specified figural requirements. 
Yet, tasks can be thought of to meet 
this case, for example, the writing of 
headlines, the production of esthetic 
effects with words, and so on. It does 
not seem likely, however, that there 
should have developed in) human 
makeup a unitary ability of this kind. 

The second row of the table offers 
The 
speed of calling up ideas expressible 
in verbal form can be tested by dif- 
ferent kinds of tasks. The two ex- 
amples of tests given were designed 
for the study of creativity. The Plot 
Titles test of fluency is scored by the 
total number of low-quality titles 
that can short 
story plot in a given time. The Con- 
sequences scored similarly, 


some interesting possibilities. 


be suegested tor a 


test is 


but the responses are consequences 
foreseen as a result of some drastic 
change, such as everyone going blind. 

It can well be questioned whether 
fluency of verbal responses of such 
kinds is strongly related to fluency 
of ideas of a mechanical, or musical, 
or pictorial kind. Fluency tests have 
been commonly cast in verbal form. 
Fluency in the production of figures 
and fluency in the production of 
things distinguished by their struc- 
tural properties may well be separate 
factors, both distinct from the idea- 
tronal-fluency factor now known. The 
exploration of such possibilities would 
seem to be a fruitful route to take in 
the investigation of creativity. 

The separateness of the factor ex- 
pressional fluency from  ideational 


fluency indicates that the ability to 


have ideas and the ability to put 
them into words are different things. 
Since the examinee must state ver- 
bally his ideas in tests of zdeational 


fluency, it might be supposed that 


his ability to express himself is in- 
But 
apparently in such a test the expres- 
sional problem is not a serious one. 
We present other tests in which the 
idea is given and the examinee must 
put it into words, usually in more 
than one way. The expressional 
problem is then more difficult, the 
test giving us variance in the expres- 
sional factor. In the Vocabulary 
Completion test, a stimulus word is 
used in a brief context, enough to 
indicate its meaning, and the ex- 
aminee has to give the word. In the 


cluded or is also being tested. 


Similes test, the examinee must give 
more than one completion to a simile. 


In a Verbal Analogies Completion 
test, which was designed to measure 
another found that the 


leading the ex pres- 


factor, we 
Variance is) in 
sional-fluency factor. 
The only complete triad in Table 
3 is a set of flexibility factors, the 
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best-known of which is adaptive 
flexibility. The three factors involved 
are not clearly parallel in all respects. 
They have in common the feature 
that sudden shifts of activity occur— 
shift of organization of a figure, shift 
of set or approach to a problem, or 
shift of responses, re- 
spectively . Thurstone discovered the 
flexibility-of-closure factor in his an- 
alvsis of perception (28) and found 
that the factor had relations indicat- 
ing its intellectual importance. 

The most consistently representa- 
tive test of the factor of adaptive flexi- 
bility is the Match Problems test. 
This test is based upon the old, fa- 
miliar puzzle or game of removing a 
specified number of match sticks in 
order to leave a specified number ot 
squares. In order to measure flexi- 
bilitv, the problem dras 
tically from one item to the next, re- 
quiring very unusual solutions; 
such as the 
would not expect. 
first 


category of 


changes 


solu- 
tions average person 
For example, at 
the examinee is led to expect 
that the remaining squares will be 
of the same size, but there comes an 
item in which they must be ot 
Another item requires 
that a smaller square be left within a 


un- 
equal size 


larger one, and so on. 

In an unpublished study, a 
involving Gottschaldt 
out as strongly loaded on ada ptive 
flexibility as did Match Problems. 
In the same analvsis, a test of In 
sight Puzzles also had a similar load- 


test 


figures 


came 


Phus, in this case, a perceptual, 


i structural, and a conceptual test 


lw. 


had strong loadings on the same fac 

tor. There is therefore the possibilits 
that flexidi/ity of closure and ada ptive 
flexibility are one and the same factor 
that this factor 
three columns of the matrix. 


and cuts across all 
In an 
analvsis where perceptual, structural, 
and conceptual flexibility. tests are 


all liberally represented, however, it 
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can be predicted that three factors 
will be found. If so, they are prob- 
ably substantially intercorrelated. 
If there are three such factors, the 
factor of spontaneous flexibility would 
have to be moved to another row of 
the matrix to be replaced by a con- 
ceptual-adaptive-flexibility factor. 
The spontaneous flext- 
bility has appeared persistently but 
never with great strength or sta- 
bility. The Brick test, tlexi- 
bility score, is the best clue to its na- 
ture. This score is the number of 
runs of responses. The examinee is 
told to name all the uses he ean think 
of for a common brick, in eight min- 
utes. A “run’’ of 
quence of uses all of the same class, 
such as the use of bricks as building 
material or as missiles, and soon. The 
test Unusual listing 
several unconventional uses for each 
of a number of objects, the number 
the Since only 
verbal tests of this factor have been 


factor of 


Uses 


responses Is a se- 


Uses calls for 


viven being score. 
analy zed, nothing can be said regarad- 
ing the that there are 
parallel factors involving figural and 
structural contents. 

It is of some interest to attempt to 
relate spontaneous flexibility to other 


possibilits 


concepts in psychology. Essentially, 
it appears to be a disposition to avoid 


This suggests a 
Thorndike’s 
retractory phase or to Hull's concept 
of reaction inhibition. A hypothesis 
to be tested would be that tests de- 
to measure individual differ- 
ences in tendency to show retractor, 
phase of the Thorndikian type and 
tendency to 
reactive inhibition indicate the same 
spontaneous 


repeating one's self. 


relation to concept of 


signed 


tests to show degree of 
factor as do tests oft 
flexibility. 

The results continue to show that 
originality is operationally definable 
as the likelihood of giving 
ventional, clever, or remotely 


uncon- 


asso- 
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crated test items (30). 
It is measurable in terms of number 
of clever titles given to story plots, 
clever ‘“‘punch lines” for 


responses to 


cartoons, 
remote consequences to events, and 
idiosyncratic word associations. In 
two analyses there has been oppor- 
tunity for a cleverness factor to sep- 
arate off from the rest, but this did 
not occur. While the factor thus 
seems to be a rather broad one, it 
may well be asked whether such a 
factor, measured only by means of 
verbal tests, is significantly related 
to original production in nonverbal 
activities such as graphic arts, music, 
or inventive engines ring 

We have had onh 
test that is at least partly nonverbal 

the Symbol Production test. This 
test another pur- 
pose, namely to test the hv pothesis 
that 
symbolize 


one originality 


was designed tor 


ability. to 
terms of simple 
line drawings. Each item presents a 
statement, “ring the bell,” 
of which the two italicized words are 
to be represented by 


there is a 


separ ite 


ideas in 
such as 


two svmbols. 
Phe score is the number of nouns and 
verbs svmbolized in the testing time. 
The test is not entirely nonverbal, of 
course, although the thing produced 
is figural. There was a second test 
requiring the pro- 
svmbols for 
adjectives in the same battery 


the Svmbol Production test. 


Drawing 
duction of 


Line 
line given 
with 
These 
two tests might have given rise to a 
factor, but thev did not. 
Nevertheless, the writer is of the 
opinion that the problem of whether 
there are originality factors peculi ir 
to nonverbal 
one. 


separate 


areas is still an open 
an ability 
to provide details working toward 
completion, when a part or an outline 
is given. The test Planning Elabora- 
tion presents the bare outline of a 
plan to which details must be added 


The elaboration factor is 
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to make it effective. In the Figure 
Production test, a simple line is given, 
to which the examinee is asked to add 
lines to complete an object. The 
score depends upon the amount of 
detail added. 

Here we have a clearly verbal test 
and a clearly figural test (although a 
meaningful object is usually pro- 
duced) both with relation to the same 
factor. There is still the possibility 
that there are two (or three) elabora- 
tion factors, distinguished in terms 
of content, with enough relationship 
between them to cause the factors to 
appear to be one. It will take a new 
analysis in which at least three good 
figure-elaboration tests three 
good verbal-elaboration tests (not to 
triad of structural-elabora- 
tion tests, also) should be included to 
determine how many 
factors there are. 

Considering the factors in the di- 
vergent-thinking category together, 
it is obvious that the freedom to 
direction of thinking varies 
considerably from instance to 
another. Different degrees of situa- 
tion-imposed restriction are involved. 
But generally, within whatever lim- 
its that are imposed by external re- 


and 
forget a 


elaboration 


change 


one 


strictions, the need for rejecting or 
superseding a response and for trv- 
ing or producing a new one is the 
common element in this group of fac- 
tors. There is also a difference in the 
amount of self-imposed restriction or 
freedom. This depends upon the in- 
dividual rather than upon the situa- 
tion. It is largely in this source of 
Variation that we find the divergent- 
thinking factors. 


Evaluation Factors 


Evaluation factors have to do with 
decisions concerning 


suitability, or 


the goodness, 
effectiveness of the 


results of thinking. After a discovery 
is made, after a product is achieved, 
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is it correct, is it the best that we can 
do, will it work? This calls for a 
judgmental step of some kind. It 
was our hypothesis in the project 
that the ability to make such deci- 
sions will depend upon the area 
within which the -thinking takes 
place and the criteria on which the 
decision is based. The results indi- 
cate several evaluation factors. They 
have been placed in the customary 
three-column matrix in Table 4, in 
spite of the fact that none have been 
found to fit the structural column. 
In this group of factors there is no 
good way of distinguishing rows. 
The domain of evaluation factors 
has been less well explored than the 
other intellectual domains. 

The least that can be said is that 
the perceptual-conceptual dichotomy 
applies in this area of abilities. Al- 
though our analysis showed only one 
factor applying to judgments of 


figural material, it is likely that in 


this subarea of evaluation alone there 
are a number of judgment factors. 
For this reason the factor of per- 
ceptual evaluation has been placed in 
parentheses in Table 4. For ex- 
ample, a more restricted factor of 
length estimation has been found (21). 
The search for such factors carries us 
over into the whole realm of psycho- 
physical judgment. It would be dif- 
hcult to say whether factors of this 
kind belong under the general head- 
ing of thinking or under the heading 
of perception. In view of the known 
complexity of psychophysical judg- 
ments in general, their place in the 
intellectual group can be defended. 

The best established evaluation 
factor is that of logical evaluation. 
This is defined as the ability to 
judge the soundness of conclusions 
where logical consistency is the cri- 
terion. The factor has sometimes 
been called “deduction,” with the 
belief that it is the ability to draw 
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conclusions logically consistent with 
premises. If this were the case, the 
factor would belong with the produc- 
tion-factors group. Most tests in 
which the factor has been found to be 
a component are of the true-false or 
multiple-choice form, in which the 
examinee is given conclusions; he 
need not produce them. It is diffi- 
cult to say whether he actually does 
produce them for himself first then 
find them among the answers pro- 
vided. But whether he does this or 
not, he must make a 
judgment as to the correctness of the 
answer—his own answer or the ones 
given him. Even in a completion 
test, this step would be necessary. 
It seems preferable, therefore, to call 
the factor logical evaluation and to 
list it among the evaluation factors. 

It was hypothesized that there 
would be a factor in which evalua- 
tion is made on the basis of past ex- 
Such a factor was found, 
and it is represented best by the test 
of Unusual Details. In this test the 
examinee is asked essentially “What 
is wrong with this picture,” in which 
there are two features that are incon- 
gruous or inconsistent with common 
experience. In defining this factor, 
whether the emphasis should be 
placed upon the supply of past ex- 
perience or upon an ability to utilize 
that experience is not known. 

The factor called judgment is listed 
with some hesitation. It was found 
repeatedly, but rather weakly, in 
AAF research (21). It is best repre- 
sented by a test in which a practical 
difficulty was described and several 
alternative solutions are offered. 
Which one is best, everything con- 
sidered? In common terminology, 
the ability might be recognized as 
wisdom or common sense. In the ap- 
titudes-project research, there is evi- 
dence that this AAF judgment fac- 
tor may be the same as the one called 


necessarily 


perience. 
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redefinition. If this is the case, it is 
not easy to say where to place the 
emphasis in defining the factor. 

The factor speed of judgment was 
found by Thurstone in his analvsis 
of perceptual abilities (28). The 
speed with which the examinee com- 
pletes the sorting of objects accord- 
ing to color or form and the speed 
with which he checks traits that ap- 
ply to himself are both measures of 
the factor. It is thus shown as cut- 
ting across the three content cate- 
It might well be classed as a 
temperament trait rather 
ability. 


gories. 
than an 


Memory Factors 


little doubt about the 
the remaining factors 
under the heading of memory factors. 
Collecting all such factors from vari- 


There is 
grouping ot 


ous sources, we find that seven qual- 
itv for this category. A recent an- 
alvsis by Kelley (27) has done much 
to verify and complete the picture 
for this group. It is possible to or- 
factors in the three 
the familiar cate- 
vories as to content, and in three rows 


vanize these 


columns of now 
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as to the kind of thing or aspect in- 
volved (see Table 5). The titles of 
the tests representing each factor are 
usually quite descriptive. 

The best-known of the memory 
factors is rote memory; the ability to 
learn and to remember things asso- 
ciated, where meaning is of little or 
no importance. In the AAF research 
this factor called ‘associative 
memory” for the reason that paired- 
associate learning was typical of the 
tests of it, 


Was 


There was a need, also, 
of distinguishing it from the factor 
of visual memory, where sheer con- 
tent is important rather than associa- 
tive connections between contents. 
Since Kellev (27) has demonstrated 
another associative-memory factor 
in the form of meaningful memory, 
however, it seems best to return to 
the name of rote memory. The place- 
ment of both in an associative row 
of the matrix indicates their common 
associative property. The Vacancy 
under the figural heading in this row 
calls tor the hypothesis that there is 
an undiscovered factor pertaining to 
the learning of associative connec- 
tions between figural contents. 


PABLE § 


\ MATRIX oF Mi 


Thing or aspect 


remembered noe 
g a 


\ssociative connec 


tions 


Visual memory 
Reproduc tion of De 
signs 


Map Memory 


Auditory memory 
Musical memory 
Rhythm 


Span 


| Digit Span 


MORY FACTORS 
Type of Content 


Structural 


Conceptual 


Rote me mory Meanin 
Word-Number 
Cx rl Tr Wi rd 


gful memory 
Sentence Completion 
Related Words 
Ve yr ry f, r ideas 
Memorv for Ideas 


Limericks 


Integration I 
Signal Interpretation 
Combat Planes 


Memory span 
Letter Span 
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The factor of o:sual memory has 
been known for some time (21). The 
factor may be regarded as a rather 
photographic-memory ability. Some 
individuals are recognized as stand- 
ing out in this respect, for example 
certain police officers who remember 
faces and motor-vehicle license num- 
bers remarkably well. In tests, the 
evidence of remembering of this type 
may be in the form of reproductions 
(Reproduction of Designs test), or 
recognition (an AAF Map Memory 
test), or verbal descriptions 
other AAF Map Memory test). 

The listing of a factor with the 
name of auditory memory represents 
in part the writer's somewhat risky 
hypothesis. It is based upon a factor 
found by Warlin (26) in tests of musi- 
cal memory (for melody and rhythm). 
French (4) called it “musical mem- 
oryv,”’ which is the cautious thing to 
The name “auditory memory” 
used here implies some confidence in 
the prediction that when nonmusi- 
eal auditory-memory are in- 
cluded with musical-memory tests 
in the same analysis, the same factor 
will apply to both. 

AAF research results hinted at the 
existence of a 


(an- 


do. 


tests 


content-memory or 
substance-memory factor but did 
not demonstrate it. Kelley's results 
give evidence for such a factor. It is 
the memory for which are 
probably not expressed verbatim in 
recall tests. Further support for this 
factor is desirable. The hypothesis 
that there is a “‘content”’ 
the structural column is still to be 
investigated. It is not easy to sav 
what this would be like. The mem- 
ory for a route might qualify. 
Memory-span tests, composed of 


ideas, 


factor in 


digits and letters have in common a 
memory-span factor. This factor be- 
longs in the structural column. Inci- 
dentally, it is interesting that mem- 
ory-span tests have been rather popu- 
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lar components of general-intelligence 
It turns out that they meas- 
ure primarily a rather special kind 
of memory ability whose social im- 
portance cannot be very great. Tel- 
ephone operators come to mind first 
in this connection. A general re- 
mark may be made, prompted by 
the emphasis upon 
tests as measures of intelligence, that 
although many tests correlate highly 
with chronological age, this does not 
ensure that they measure any very 
significant aspect of intelligence. 

In the conceptual column, Jnte- 
gration I, a factor found in AAF re- 
search, is proposed as a memory- 
span factor. The tests Signal Inter- 
pretation and Combat Planes re- 
quire the examinee to keep in mind 
a relatively large number of detailed 
Kelle (27) 
had one span test in which the con- 
tent the nature of lists of 
tasks to be done, the length varying 
as in digit and letter-span tests. It 
came out with those other span tests 
on his memory-span factor. It can be 
predicted that if there were other 


scales. 


memory-span 


rules for success in them. 


Was in 


idea-span tests, and perhaps some 


Integration-I tests in the battery, 
two factors would be found.® 
The span factors are probably  sig- 
nificantly correlated. Phe vacant 
cell in Row 3 of Table 4 suggests that 
the Way is open tor someone to see 
whether a third memory-span factor 
will be found where the contents are 
figural. 


span 


To digress somewhat from an ac- 
count of the factors, it may be 
pointed out that the fact that there 
are several distinct memory abilities 
may explain some of the phenomena 
observed in memory experiments, 
particularly where results are dis- 
cordant. Results from memory ex- 

® Another hypothesis is tenable with regard 


to Integration I, however. It might be identi 
cal with the factor memory for ideas. 
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periments may differ markedly, some- 
times, depending upon the kind of 
material and the thing or aspect em- 
phasized. For example, the relative 
strength of backward vs. forward 
associations differs when the material 
is composed of visual forms or is com- 
posed of syllables. In transfer ex- 
periments, in view of the different 
abilities involved, it should 
that transfers of 
in. memorizing skills should be so 
limited. It would be interesting to 
test the hypothesis that transfer will 
be relatively greater between tasks 
that depend upon the same memory 
factor or upon the more strongly cor- 
related factors. 


not be 


surprising gains 


The same hypothe- 
stated with respect to 
thinking factors and other ability 
factors generally. 


SIs ( ould be 


DISCUSSION 


The account of the known intel- 
lectual ind the svstem into 
which thev seem to fall calls for the 


factors 


discussion of some general questions. 


There are implications for tactor 


theory and for its application to psv- 
There 


are implications for general psvcho- 


chologi al resea4re h in eeneral. 


log al theory and for the prac tices of 
intelligence testing 
Factor 


Implications for Theory 


Factor Analysis 


and 


\ theory or a method should be 
judged by its fruits. If the results 
that have been reported here con- 
tribute to psvchological understand- 
ing and, through that, to useful psv- 
chological practice, factor analysis 
has passed this kind of test. The 
mathematical model that has been 
applied, which conceives of individ- 
ual differences in intellectual per- 
formances as being represented by a 
coordinate system of » dimensions, 
has served certain purposes. While 
it may be shown at some future time 
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that the model is not the best that 
could be applied, its power to gen- 
erate new psychological ideas and to 
extend considerably the conception 
of the realm of intellect has been dem- 
onstrated. 

The average reader will no doubt 
be surprised by the large number of 
dimensions that seem to be required 
to encompass the range of intellectual 
aspects of human nature. Some 40 
factors are reported as being known 
and a many additional 
known forecast. 


great un- 
This 
would seem to go against the scien- 
tific urge for parsimony. 

The principle of parsimony has led 
us in the past to the extreme of one 
intellectual dimension, which every- 


factors are 


one should now regard as going too 
far in that There is ac- 
tually no fixed criterion for the satis- 
faction of the principle of parsimony. 
In science we can satisfy the princi- 
ple to some degree whenever the num- 
ber of concepts is smaller than the 
number of phenomena observed. 
Forty, sixty, or even a hundred fac- 
would certainly be a smaller 
number of concepts than the number 


direction. 


tors 


ot possible tests or the number of ob- 
servable types of activities of an in- 
tellectual character. In this sense the 
principle of parsimony has been satis- 
tied. 

The number of the factors is less 
unattractive when we find that thev 
can be subsumed within a system 
that is describable by a smaller num- 
ber of categories or principles, as we 
have seen in the matrices of Tables 
1-5. Some readers will ask whether, 
since there are many probable inter- 
the factors, a 
small set of second-order factors will 
not suffice. Granting that we can 
make sufficiently accurate estimates 
ot the intercorrelations among the 
factors, which the writer doubts that 
we can do at present, to use only sec- 


correlations among 
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ond-order-factor concepts would lose 
information. This follows from the 
fact that where » linearly independ- 
‘nt dimensions are necessary to de- 
scribe a domain geometrically, no 
one dimension can be 
counted for by 
others. 

It may be asked whether some of 
the factors listed are not really spe- 
cific factors rather than common fac- 
tors. This is a legitimate question. 
It is not uncommon experience in fac- 
tor analysis to find what was for 
merly regarded as a single common 


entirely ac- 
combinations of the 


factor appears later to split up into 
two or more factors. The “splitting 
up” not completely 
It applies best to the fact 
that a group of tests having a 
tor” 


des ription 1s 
accurate. 
“‘Lac- 
in common later divide into two 
or more groups each defining its own 
common factor. In thinking 
about this phenomenon, we must 
keep in mind the distinction between 


clear 


“factor” as a mathematical concept 
and “factor” as a psychological con- 


The immediate results of a 
factor analysis are in terms of mathe- 
matical factors. Whether each math- 
ematical represents a single 
psychological factor or a combina- 
tion of psychological factors has to 
be determined by interpretation and 
by further experimental work ap- 
plied to the designing of new factor 
analyses. Eventually we reach the 
stage where further efforts to ‘“‘split”’ 
a factor fail. Whether this has 
brought us to a specific factor in anv 
particular case can be decided on the 
Are the 
tests defining this factor essentially 
just different forms of the same test ? 
This cannot always be decided with 
certainty, but there is usually little 
difficulty in doing so. If we suspect 
that any factor is a specific, a new 
analysis that includes more obvi- 
ously different tests, but tests that 


cept. 


factor 


basis of a single criterion. 
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should measure the same 
factor, should be done. 
Skepticism Was expressed above 
concerning the operation of estimat- 
ing factor intercorrelations. This 
is a somewhat complicated problem 
for which there is as vet no good solu 
tion. The common procedure in 
the present time for esti- 
mating factor intercorrelations is to 


common 


vogue at 


do an oblique rotation of axes, lo- 
cate the primary axes and determine 
the cosines of their angles of separa- 
tion. The writer has preferred orthog- 
onal several 
Briefly, any particular oblique solu- 
tion to a factor problem is a function 
of several nonpsvchological circum- 
For one thing, it depends 
upon the kind of population tested 
This is not so serious, but we should 
probably have a different set of fac- 
tor intercorrelations for each age 
group, level, cultural 
milieu, etc., and for combinations ot 
these. This lack of invariance pre 
cludes making ver\ 
statements the 
logical interdependencies of factors 

A more matter is that 
oblique solutions Gepend upon the 
population of tests that we factor 
analyze. merely a sam- 
pling problem, for the collection of 
tests in a battery is 


rotations tor reasons. 


stances. 


educational 


general 


psv« ho 


any 


regarding 


serious 


This is not 
never a ran- 
domly selected one, and should cer 
tainly not be. Much of this difficulty 
hinges on inadequacies of test con- 
struction and test administration. 
Rarely do we succeed well enough, 
either bv test construction or by test 
administration, in the ex- 
perimental controls it would take to 
come out with a score that is a pure 
factor. If two factors 
happen to be commonly loaded in the 
that define both of them, it 
would give the appearance of a fac- 
tor intercorrelation whether there 
was genuine correlation or not. This 


exerting 


measure of a 


tests 








THE STRUCTURE OF INTELLECT 


kind of result is not uncommon. Until 
we succeed in exerting better experi- 
mental controls in testing, we shall 
not have a very good basis for esti- 
mating factor intercorrelations, even 
for a population of ex- 
aminees. 

The question alwavs comes up re- 
garding factors; are 
they inherited or are they acquired, 
to use the common, loose expression 
The reply is that 
factor analysis alone cannot answer 
this question. So far as factor analy- 
the factors could all 
be hereditary in origin, or all environ- 
mental, all some weighted combina- 
tion of both heredity and environ- 
ment, or some due to the one and 
the other source. It will 
take experimental work of the usual 
tvpes to answer this question. But 
thing is clear. The question “Is 
intelligence inherited or 1s it ac- 
quired” makes less sense than it ever 
did. Such a question must be asked 
regarding each and every factor. Fer- 
guson (4 
ing hypothesis 


specified 


the origins of 


of this question, 


SIS is conce#nri ed, 


some to 


one 


has expressed the interest- 

that 
the 

transter ot learning. 


factors are a 
principles of 
Many of them 


mav be, toa large extent. The Fergu- 


consequence = ot 


son hypothesis is akin to a similar one 
expressed earlier in this paper. 
In connection with origins of fac- 


tors, there is also the question of 
when in child development the fac- 


make their 
the extent that factors are developed 
by experience, they would appear at 


tors appearances. To 


such ages as the effects of experience 
have sufficiently ervstallized. To the 
that heredity is chiefly re- 
sponsible for the differentiation of 
factors, their appearances should be 
detectable when maturation effects 
their differentiation. In either case, 
the answer is to be determined by ex- 
perimental testing and factor analv- 
sis at all age levels at which suitable 


extent 
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tests can be administered. Such an- 
alyses should be done in populations 
very homogeneous with respect to 
age and other features. It can be 
predicted that the structure of the 
intellectual factors for children will 
be found simpler than that for adults. 
It can also be hypothesized that the 
structure for generally superior adults 
will be found more complex than for 
generally inferior adults. 


Implications for Psychological Theory 


It was suggested earlier that al- 
though psychological factors are vari- 
individual differences 
they also indicate psychological func- 
tions within individuals. It is there- 
fore in order to take the factors seri- 
ously as starting points for psycho- 
logical theory. 

There has never been developed a 
comprehensive theory of thinking. 
We have been short of the essential 
concepts needed in the construction 
of such a theory. In view of the great 
variety of thinking abilities (and 
functions) revealed by factor analv- 
sis, the time-honored concepts of 
reasoning, induction, deduction, and 
the like appear even more inadequate 
than before. It seems to be of little 
value to attempt to relate the factors 
The factors, in- 
stead, have generated their own cate- 
gories, which have been already pre- 
sented. Thev are essentially opera- 
tional concepts, since, like factors, 
they refer back to the kinds of tests 
from which factor definitions were 
inferred. 

Although the general picture of the 
thinking factors is not vet sufficiently 
complete or certain to suggest an 
obvious, general theory of thinking, 
the kind of theory that they will 
eventually generate can be seen. 

It is fairly well agreed that think- 
ing is symbolic behavior. It is not 
surprising, then, that certain factors 


ables among 


to those categories. 
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have to do with symbols, as such, and 
with their utilization and manipula- 
tion. Of all the kinds of symbols 
available to humans in almost any 
culture, words and numbers are 
among those of greatest importance. 
The factors retlect these facts. 

In the operations of thinking, of 
realistic thinking, in particular, the 
factors indicate the important steps 
or processes of discovery, produc- 
tion, and evaluation, often occurring 
roughly in that temporal order. Di- 
vergent thinking may come into the 
picture along with these other phases, 
and auxiliary to them, particularly 
when they proceed with some difh- 
culty. Some divergent-thinking pro- 
cesses are also likely to occur in non- 
realistic thinking, when one is simply 
free to do so and finds it rewarding 
Since realistic thinking ts usually 
convergent, particularly when there 
is one right answer, at times there 
mav be conflicting divergent-con- 
vergent tendencies, a phenomenon 
that has not been reported, to the 
knowledge of the writer. 

Quite generally, it the 
thinking processes of a person may 
proceed more or less ably depending 
upon the kind of content with which 
he is involved—perceived — figures, 
recognized structures, or conceived 
meanings. The distinction that has 
sometimes been made between con- 
crete thinking and abstract thinking 
has foreshadowed the major distinc- 
tion here; the distinction between 
figural factors and conceptual fac- 
tors. The appearance of the third 
category —structural—-came as a sur 
prise. If it turns out to be important, 
we have several interesting implica- 
tions. 

One practical implication of the 
structural category is that tests 
based upon letter material and the 
like mav be of limited significance, 
if in realitv we are interested in pre- 


seems, 
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dicting behavior that depends upon 
factors in the figural or conceptual 
columns. more important implica- 
tion has with the fact that 
there is a shortage of known factors 
in the structural column. A rather 
direct reason for this may be that 
there has been a bias toward figural 
and verbal test material, with an 
unfortunate slighting of structural 
material. This would not be so un- 
fortunate if it turns out that in 
civilization not many such factors 
exist, or if they do exist they are of 
relatively little social importance It 
may be that there ts actually 
structural-type thinking going on 
than we realize and that both 
chologists and educators have failed 
properly to recognize it. In a highly 
technical age, such thinking would 
seem to be important. We might well 
ask ourselves whether we have over- 
looked something of importance in 
this general area. 


to do 


more 


psv- 


> 


The headings of rows in Tables 1-3 
an unusual list of concepts, 
which appear to be more epistemo- 
logical than psvchological. Is this 
possibly the kind ot concepts that 
we have needed?) It may be possible 


present 


to give some of them more psvcho- 


logical terminology later, but at 
present they refer to the kinds ot 
things that we can know and ean 
produce. If terminology de 
scribes behavior in a significant and 
useful manner, it should be wel- 
comed and its worth should be recog- 
nized. One impli ition is that the 
lists seem to be open to new addi- 
tions. what 
gories might be added to the lists 
might turn up some fruitful 
hypotheses regarding unknown fac- 
tors and functions. 

The subject ol problem 
has come into considerable promi- 
nence in recent: vears. The picture of 
the thinking factors has important 


such 


Consideration ot cate- 


new 


solving 
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We 
tind that there is no one factor that 
This 
Problem solving is us- 
ually a complicated process. It is 
clearly indicated that we should stop 
looking for any one function or pro- 
cess that is the sime qua non of all 
problem solving. As the writer has 
pointed out elsewhere, many factors, 
including perceptual factors as well 
as thinking factors, mav be called 
into play, depending upon the nature 
of the problem (12). 

In the hst of thinking factors we 
tind one factor having to do with the 
ability a problem 
that per- 
tains to the diagnosis of the problem. 
The generality of either 
factor is still to be determined. So 
fur as we know now, either may be 


implications tor problem solving. 


can be called problem solving. 
Is significant. 


to recognize that 
exists and another factor 


degree ot 


restricted to a relatively narrow cate- 
gory of problems. The next steps in 
the attack on problem solving should 
be to make a survey of the variety of 
problems that are common and to at- 
tempt to write specifications regard- 
ing the factorial abilities that play 
the solution of 
each type of problem. We should 
then test these hypotheses by experi- 
mental and factor-analytic preced- 
ures, 


significant roles in 


At the beginning of the aptitudes- 
project investigation of creativity it 
was hypothesized that certain spe- 
cial, creative factors would be found, 


then already 
The results have 
supported most of the hypothesized 
factors but not all (20, 31). Because 
these factors were investigated within 
the arbitrarily designated domain of 
creativity, there has been a tendency 
to think of them as being the exclu- 
sive creative factors. 


them being 
known, some not. 


a few ot 


This concep- 
tion is not fully correct. Creative 
thinking, like problem solving (they 
may actually overlap in many cases), 
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depends upon diferent combinations 
of factors, and the combination of 
factors significant to the task will 
vary from time to time. The problem 
confronting us here, as with problem 
solving, is to recognize the main cate- 
gories of creative production and to 
seek the significant combinations of 
factors involved in them. Although 


certain factors such as izdeational 


fluency and originality will carry rela- 


tively more weight, other factors not 
obviously creative may often be sig- 
nificant, as when an invention de- 
pends upon thinking by analogy or 
upon visualization. 

Thinking has many connections 
with learning, and hence the thinking 
factors are of some importance in 
learning investigations and learning 
theory. Thinking is sometimes re- 
garded as a form of learning, for while 
we think we usually learn. Another 
view of the connection is that think- 
ing contributes to learning. The lat- 
ter view is more productive of ap- 
proaches to investigation of the role 
of factors in learning. It is not 
enough to that thinking 
contributes to learning or even to 
state and to test this as a general hy- 
pothesis. The questions raised here 
should be ‘Where and how does fac- 
tor X contribute to learning?” just 
as it was asked in the preceding para- 
graphs where and how each factor 
contributes to problem solving and 
creative activity. Since problem 
solving and creative activity are 
properly regarded as instances ot 
learning, we need only generalize the 
question to make it apply to all learn- 
ing. Fleishman and Hempel (5, 6, 
7) have already provided some ex- 
cellent demonstrations of the roles of 
factors at different the 
learning process for certain psycho- 
motor tasks. This type of investiga- 
tion should be applied more gen- 
erally. Certainly we should have 


conclude 


stages in 
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outgrown the glib definition that 
“intelligence is learning ability.”’ 

The distinction between 
tive and content-memory factors re- 
minds us that not enough attention 
is generally paid to the same distine- 
tion in studies of learning and mem- 
ory. Learning theory has restricted 
itself almost entirely to the forma- 
tion and retention of associative con- 
nections, leaving out of account the 
learning of substance. 

Speaking of learning suggests the 
practical operation of education. At 
some future time factors should have 
much effect upon educational prac- 
tices, in addition to those effects hav- 
ing to do with assessment. If train- 
ing and experience have much to do 
with the development of the factors, 
it is important to know the factors 
and to determine the procedures 
whereby their development can be 
promoted by education. 

There are many possible relation- 
ships of the intellectual factors to 
pathology. Defects of memory and 
thinking are common occurrences in 
connection with intellectual 
that are associated with organic and 
functional pathologies. If we tind by 
observation and by experimental 
study that defects tend to be along 
the lines of the intellectual factors, 
we have another source of evidence 
for the validity of the factors as func- 
tional unities. In practice, the use of 
measures of the factors may be help- 
ful in providing more accurate and 
meaningful assessment of intellectual 
Losses described in terms ol 
the factor concepts may help in un- 
derstanding the types of pathology, 
and in providing better definitions 
and diagnostic criteria. 


associa- 


kk sses 


losses. 


Intelligence and Intelligence Tests 


A treatment of the factors of in- 
tellect would be incomplete without 
considering their implications for the 
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concept of intelligence and for the 
present and future of intelligence 
testing. Is the concept of intelligence 
still useful? What is the nature of 
current intelligence tests in terms of 
What should the future 
trends in intelligence testing be? 

As to the 
term “‘intellect’’ can be meaning- 
fully defined as the svstem of think- 
ing and memory factors, functions, 
or processes. The term “intelligence” 
has never been uniquely or 
factorily defined. Factor 
has fairly well demonstrated that it 
is not a unique, unitary phenomenon. 
A “general factor,” found by what- 
ever method, is not invariant from 
one analysis to another and hence 
fails to qualify 
ent of research circumstances, as 
Vernon well stated (29). The 
methods of multiple-factor analvsis, 
whic h have been chiefly responsible 
for discovering the listed 
above, do not tind a general psvcho- 
logical factor at the first-order level 
and they find no second-order factor 
that can properly lay the 
title of 

The term 
none the less. 


factors? 


general terminol mv, 


Satis- 


in ilvsis 


as a unity, independ- 


has 


factors 


claim to 
‘“intelligenc 

is useful, 
But it should be used 
in a semipopular, technological sense. 
It is convenient to have such a term, 
even though it is one of the many 
rather shift, concepts we have in ap- 
plied psychology. It would be very 
desirable, for purposes of communi- 
cation and understanding, to specify 
a number of mntelli- 
gence A, intelligence B, and so on. 
This could be done in terms of the 
combinations of certain intellectual 
factors and their weightings in the 
combinations. 

We have such combinations now in 
connection with the intelligence tests 
and scales in common use. Let us 
consider what kind of combinations 
we have in two of the most used in- 


“intelligence” 


intelligences 
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telligence scales. A really good factor 
analysis of the Stanford Revision of 
the Binet scale would be rather diff- 
cult, and cannot satisfac- 
torily without adding to the analyzed 
battery a liberal number of reference 
tests. This has never beer done. The 
best analyses that we have were done 
by Jones (24, 25), who found ten fac- 
30 selected items. His 
resulting picture is not clear because 


be done 


tors among 
among the 30 items were essentially 
alternate (at different 
and no outside reference 
A fully satistactory 
analvsis of the Stanford-Binet items 
would undoubtedly reveal more than 
ten factors present. 

It should be noted that 
many factors are present, a 
based upon all 


measure each 


forms of tests 
age levels 


tests were used. 


when so 
composite 
the items caf 
component 


score 
only to a 
nearly 


small degree, if thev are 


equally weichted in the composite. 
It can also be predicted that the 
factorial composition of the Binet 1Q 
will be found to vary somewhat from 
age level to another. This feature 
contribute to a small extent to 
obtained changes in 1Q where sub- 
stantial involved. 

As it actually happens, a Stanford- 


Binet IQ, or any IO trom a test whose 


one 


May 


age dittere nees are 


predominantly ver- 
bal, isa total score heavily dominated 
by the verbal-comprehension factor 
Phis the 
little or no effective voice in the com- 


components are 


leaves other factors with 
they are 
In nonverbal in- 
to be 
factor, 
but the nature of the composite var- 


to bat- 


posite, even though repre- 
sented in the scale 
telligence tests, there is likely 
less domination bv anv one 
ies considerably from battery 
tery. 

Analyses of the « ompowents of the 
Wechsler- Belle vue have 
been generally inadequate. The most 
adequate analysis has been done by 


Davis (3), who utilized a number of 


scale also 
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reference tests from outside the 
Wechsler battery. He found nine 
common factors, six of which are 
probably to be identified with factors 
in the intellectual list. Where stand- 
ard tests of intelligence are widely 
used, it becomes increasingly impor- 
tant to attempt to write the specifi- 
cations for their total scores as well 
as their part scores, so that obtained 
individuals may be most 
meaningfully interpreted. 
Intelligence will probably 
continue to be used for some time to 
come much as they are. In order to 
use them most wisely and to extract 
the greatest amount of information 
from their scores, the specification of 
such scores in terms of known factors 
is One important improvement that 
could be made. The other great step 
toward improvement in intelligence 
testing would be to emphasize more 
than at present some of the socially 
important factors that have to do 
with productive thinking. The 
knowledge of the factors of this kind 
and of the kinds of tests that meas- 
ure them is largely available. Only 
by this kind of extension of intelli- 
testing can we 


scores ol 


tests 


do adequate 
justice to adult, human intellect. 
Other extensions may also be very 
useful, 
complete coverage of the intellectual 


gence 


for we are a long way from 


factors in present tests. For differ- 
ential prediction, this includes 
the operation of vocational guidance, 
only single-factor scores will do com- 
plete justice in the description of in- 
dividuals. As a necessary prelude to 
to the use of factor measures for such 
purposes, we need innumerable vali- 
dation studies in which factors play 
an important role, studies such as 


those by Hills and others (23, 18). 


and 


SUMMARY 


A listing of the factors that can be 
regarded as intellectual was made, 





292 J. 


including those reported in French's 
summary of factors (8) appearing in 
1951 and those reported since that 
time. Of approximately 40 such fac- 
tors, seven are memory factors and 
the remaining ones have to do with 
thinking. 

An attempt was made to formulate 
a system into which the factors seem 
to fall. The thinking factors were 
categorized under the general head- 
ings of cognition (discovery), produc- 
tion (convergent thinking and diverg- 
ent thinking), and evaluation. The 
factors in each group can be arranged 
according to three kinds of content of 
thinking—figural, structural, and con- 
ceptual. In the cognition and produc- 
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It is by now generally recognized 
that all forms of psychotherapy vield 
successful results with some patients 
and that these successes depend to an 
undetermined extent on factors com- 
mon to many types of relationship 
between patient and therapist. This 
poses a knotty problem for propo- 
nents of various specific forms of psv- 
chotherapy who are convinced that 
their result from their 
particular theory or technique and 
wish to convince others of this. As a 
result, problems of research design 
in psychotherapy have been receiving 
more and more critical attention in 
recent vears, especially with reference 
to controls (6, 11, 20, 23, 24, 25, 27, 
31, 34, 35, 38, 39). 

Certain general aspects of the psv- 
chotherapeutic relationship seem very 
similar to those responsible for the so- 
called placebo effect, which is well 
known to investigators of the thera- 
peutic efficacy of medications. The 
purpose of this paper is to describe 
the placebo effect, discuss some of its 
implications for the evaluation of 
psychotherapy, and make some rec- 
ommendations concerning 
psychotherapy 
these considerations. 


SUCCESSES 


resea4re h 


design in based on 


THe PLACEBO EFFECT 

We have now participated in two 
separate investigations of the effec- 
tiveness of drugs on the symptomatic 
distress of psychiatric outpatients 
(14, 22). Both studies involved the 
administration of a placebo, an inert 
agent outwardly — indistinguishable 
from the ¢ 
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gent being tested, as well 


as drugs. The physician never knew 
whether he was giving the patient 
drug or placebo. The patients were 
told that a new medicine had become 
available which, it thought 
might help them. The physicians 
rated symptoms on a 4-point scale of 
with high reliability. li 
both studies a significant reduction o! 
distress accompanied the taking of 
placebos, as shown in Table 1. 

This phenomenon occurs with great 
regularity, not only with respect to 
the kinds of symptoms usually asso- 
ciated with psychologic but 
with others as well. For example, in 
a study of vaccines for the common 
cold, there was found 
the number of vearly colds of 55 per 
cent among those given vaccine and 
of 61 per cent among a control group 
who received injections ol 
sodium chloride solution (4). 
(15) found placebos as 
other agents in inhibiting the cough 
reflex. Wolf and Pinsky (37) studied 
medical outpatients suffering from 
peptic ulcer, migraine, muscle ten- 
sion, headache, and tight muscles in 
the extremities. All were also tense 
and anxious. Twenty to thirty per 
cent felt better while taking placebos. 
Lasagna et al. (19) ml. of 
saline by subcutaneous injection to 


Was 


distress, 


illness, 


a reduction in 


isotonic 
Hillis 


effective as 


vave 1 


surgical patientssuffering fromsteady, 
severe wound pains and found that 
30 to 40 per cent reported a satis- 


factory relief of pain. In a study by 
Jellinek (18) 60 per cent of 199 sub- 
jects with chronic headaches received 
relief from a placebo on one or more 


Occasions, 
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rABLE 1 


SyMptoM DistrkesSS BEFORI 


Study Drug tested 


Ist study 17 
2nd study 16 


Mephenesin 
Reserpine 


The placebo effect is not always 


favorable, but may also result in un- 
desirable, distressful reactions. As 
far back as 1933, Diehl (3) using lac- 
tose plac ebos as a control fora variets 
taken meuth, 
found that some of his subjects re- 


of medications by 
ceiving placebos developed nausea, 
Sometimes 
this “toxic response’’ to placebos may 
even attain major proportions. Wolt 
and Pinsky (37) tell of one patient 
who had “overwhelming weakness, 
palpitation, and nausea within 15 
minutes of taking her tablets.” In 
another, “a diffuse itchy erythema- 
tous maculopapular rash developed 
after ten days of taking pills. A skin 
consultant considered the eruption to 
be ty pi al dermatitis medicamentosa. 
After use of the pills was stopped, the 
eruption que kly cleared.” A third 
patient developed epigastric pain fol- 
lowed by watery diarrhea, urticaria, 
and angioneurotic edema of the lips 
within ten minutes of taking her pills. 
One of our own patients, who had 


faintness, and diarrhea. 


been tolerating a chronic syphilo- 
phobia fairly well, became acutely 
agitated shortly after placebo inges- 
tion, bemoaning what the pills had 
done to him, and required hospitaliza- 
tion shortly thereatter. 

Wolf and Pinsky (37) found that 
placebos produced improve- 
ment in subjective than objective 
manifestations of anxiety and ten- 
sion, but objective changes also oc- 
cur. In our second study (22), 69 
per cent of our patients showed de- 


more 


EXPERIMENTS AND AFTER A TRIAL ON PLACEBOS 


Mean distress scores 


\fter 
placebo 


Before 
experiment 


Significance 
of difference 
2558 


34.06 


O01 
02 


15.88 
24.69 


blood 
readings following placebo, 19 
showed blood 

sure, and 25 per cent showed a rise in 
pulse rate. Wolt (36) demonstrated 
clearly and convin« ingly that actual 


pulse 
per 


pres- 


creased pressure and 


cent increased 


end-organ changes can follow placebo 
administration. This demonstration 
was made in a series of studies on the 
now-celebrated Tom, a human sub- 
ject with a large gastric fistula, in 
whom it was possible to observe di- 
rectly the gastric mucous membrane, 
correlating changes in color and tur- 
gidity with simultaneous measure- 
ments of gastric secretion and motor 
activity. 

The placebo effect may actually 
the normal pharmacologic 
iction of a drug. For example, Wolf 
reports that Tom was repeatedly 
given Prostigmine, which induced 
abdominal cramps, diarrhea, as well 
as hyperaemia, hypersecretion, and 
hypermotility of the stomach. Sub- 
sequently, the response oc- 
curred not only to tap water and lac- 
tose capsules, but also to atropine 
sulfate which usually has an ianhibit- 
ing effect on gastric function. <A 
pregnant patient with excessive vom- 
iting showed the usual response ot 
nausea and vomiting to ipecac. These 
manifestations were accompanied by 


reverse 


same 


cessation of normal gastric contrac- 
tions, When ipecac was given 
through a tube with strong assurance 
that it would relieve her vomiting, 
gastric contractions were resumed at 
the same interval after ingestion of 
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the drug that they would normally 
have ceased, and her and 
vomiting were relieved. 

The placebo effect, in short, can 
be quite powerful. It can signifi- 
cantly modify the patient’s physio- 
logical functioning, even to the ex- 
tent of reversing the normal pharma- 
cological action of drugs; and, as will 
be discussed below, it may be endur- 
ing, 
missed 


nausea 


Placebo effects cannot be dis- 
transient. 
Phey often involve an increased sense 
ol well-being in the patient and are 
manifested primarily by relief from 
the particular symptomatic distress 
for which the patient expects and re- 
treatment. Thus, the 
particular complaint by a 
medication not sufficient 
evidence for the specific effect of the 
medicine on this complaint unless it 
can be shown that the relief 
obtained as a plac ebo effect. 


as superticial or 


ceives reliet 
of any 
given 


Is 


is 


not 


IMPLICATIONS OF THE PLACEBY 
IFFECT FOR RESEARCH IN 
PSYCHOTHERAPY 


The giving of anv medication may 


have certain meanings for a patient 
im terms ot his relationship to his 
physician which may benefit his con- 


dition irrespective of the pharma- 
cological action of the drug. For ex- 
ample, it may relieve the anxiety re- 
sulting from the distress caused by 
illness (10). Wolf believes the 


effects of placebos on his patients 


his 


“depended for their force on the con- 
viction of the patient that this or that 
effect would result.”” The degree of 
the patient's conviction might be ex- 
pected to be influenced by his previ- 
ous experiences with doctors, his 
confidence in his physician, his sug- 
vestibilitv, the suggestibilitv-enhanc- 
ing aspects of the situation in which 
the therapeutic agent is being ad- 
ministered, and his faith in or fear of 
the therapeutic agent itself. These 


A 


VD 
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attitudes are obviously relevaut 
psychotherapy. 

Psychotherapists have theories of 
personality and psychotherapy and 
plan their therapeutic actions in the 
belief that these are the active agents 
which produce the desired results. 
\ny 
consequent to a course of psvchother- 
apy 
the validity of the theory of person- 
ality and neurosis which underlie the 
rationale of the psychotherapy. In 
view of the above discussion it may 
well be that the efficacy of any par- 
ticular set of therapeutic operations 
lies in their analogy to a placebo in 
that they the 
and patient's conviction that some- 
thing useful is being done. Patients 
entering psychotherapy have various 


to 


favorable changes in patients 


tend to be cited as evidence for 


enhance therapist's 


and 
this may be an important factor in 
the results of therapy, but this has 
not been studied, to our knowledge. 
We know that the authoritarian at- 
titude of the physician can produce 
this conviction in some patients. 
At first glance the attitudes found 
Fiedler (8, 9) to characterize ex- 
perienced psvchotherapists, viz. feel- 
ings of empathy for and closeness to 
the patient, an undemanding atti- 
tude, security, and the ability to “‘un- 
derstand” the patient, dia 
metrically opposed to the authoritar- 
ian attitude. It may be, however, 
that the therapeutic efficacy of these 
attitudes lies primarily in their abil- 
itv to increase the confidence of cer- 
tain patients in the ability of the 
therapist to help them. Lack of such 
confidence may be one of the reasons 
why patients of lower socioeconomic 
status fare less well in psychotherapy 
than patients higher in this scale (16, 
29), a talking therapy seeming to be 
beyond their comprehension and con- 
trary to their conception of the doc- 
tor-patient relationship. 


degrees of belief in its efficacy, 


bv 


seem 
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In this connection, the role of sug- 
gestion in psychotherapy has been 
emphasized for vears, especially in 
therapies utilizing hypnosis, but sug- 
gestion effects have thought 
by many since Freud to be superficial 
and transitory. We know of no ex- 
perimental study which demonstrates 
that therapeutic effects based on in- 
sights or perceptual reorganization, 
which may also be suggested, are less 
superficial or less transitory. 

It may be pointed out parentheti- 
cally that conviction of the helpful- 
ness of therapy need not be equated 
with “motivation for therapy,’ which 
Was investigated by Grummon (13 
and Dymond (5) and found to have 
little relationship to success in psy- 
chotherapy. 
ficiently strongly 
motivated to receive help, vet have 
little faith that a procedure such as 
psychotherapy can help them. 

The similarity of the forces operat- 
ing in psvchotherapy and the placebo 
effect may account for the high con- 
sistency of improvement rates found 
with from. that 
conducted by physicians without psv- 


been 


Patients are often sut- 
distressed to be 


Various 


therapies, 


chiatric training to intensive psvcho- 
analvsis (7 This explanation gains 
plausibility from the fact that re- 
ported improvement rates for vari- 
ous series of neurotics treated by dif- 
ferent forms of psychotherapy hover 
around 60 per cent (1). This is the 
same as that reported for the placebo 
effect in illnesses in which emotional 
components may play a major role 
such as “colds” (3) and headaches 
(18). 

To show that a specific form of 
treatment produces more than a non- 
placebo effect it must be 
shown that its effects are stronger, 
last longer, or are qualitatively dif- 
ferent from those produced by the 
administration of placebos, or that it 
affects different patients. 


specific 


types of 
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Our knowledge of all these matters 
is still fragmentary, but some begin- 
nings have been made. 

With respect to the strength and 
qualitative nature of the effects of 
therapy, one line of endeavor has 
been to study the physiological 
changes occurring during psycho- 
therapy. Since physiological meas- 
ures usually used to provide evidence 
of resistance or frustration (26, 33) or 
similar psychological states during 
psychotherapy (28) may also be in- 
fluenced by the placebo etfect, one 
cannot conclude that demonstration 
of such physiological changes implies 
a greater depth ot therapy or a more 
profound reorganization of the per- 
sonality, willing to 
equate the placebo effect with such 


unless we are 
reorganization. 

With respect to the 
improvement, if it could be shown 
that the placebo effect is of shorter 
duration than changes specific to a 
given psychotherapy, this would pro- 
vide one kind of evidence favoring 
that theory ot psychotherapy. As 
far as we know, no study of the limits 
of duration of the placebo effect has 
made. Our experiment with 
mephenesin vs. placebo covered tour 
two-week periods. Figure 1 
the curves for both agents for 
eight weeks. 

Figure 1 shows that the greatest 
decrease in distress following place- 
bos was felt during the first two-week 
trial period. After that, a slight but 
statistically insignificant rise in dis- 
tress occurred; and, at the end of 
eight weeks, the placebo effect was 
about as great as after two weeks. 
Unfortunately, our data vielded no 
information on how much longer it 
might have endured. If the effect is 
analogous to the relief of pain by 
placebos in patients with surgical 
wounds, we should expect it eventu- 
ally to diminish. Lasagna et al. (19) 


duration of 


been 


shows 
the 
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Fic. 1. Eerects oF MEPHENESIN AND 
PLACEBO ON SyMPTOMATIC DISTRESS OVER AN 
8-\Wrek PERIOD 

Potal patients =17. At the 2-, 4-, 6-, and 
8-week intervals, .V for placebo =11, 6, 10, 
and 7 respectively, while NV for mephenesin 
=6, 11, 7, and 10 respectively For the 2 
and 4-week periods, the dosage of mephenesin 
was 3 gms. per day; for the 6- 
periods, 9 gms. per day 


and 8& week 


found that as placebo therapy. of such 
continued the relief ex- 
perienced decreased. 

Although the number of patients 
is too small to justify any conclusions, 
it is intriguing that the first dose of 
mephenesin seemed to counteract the 
placebo etfect. In the study with 
reserpine (22), the only patients who 
failed to show a placebo effect were 
those who had received reserpine pre- 
viously. It may be that any discom- 
fort produced by a pharmacologi- 
cally active agent tends to counteract 
the emotional state responsible for a 
placebo effect in susceptible patients. 
Analogously, an activity by the psv- 
chotherapist which disturbs the pa- 
tient) may counteract 
the placebo effect of psychotherapy 
with certain patients. 

It would also be helpful to know it 
patients could be differentiated ac 
cording to attributes which predis 


patients 


COnE eivably 


posed them to a positive or negative 


placebo effect. If patients who im- 
proved with a particular form of psv- 


chotherapy were all known to be 
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positive placebo reactors, then the 
improvement could not be attributed 
to the specific form of treatment. If, 
however, they were known not to be 
positive placebo reactors, then any 
demonstrated improvement would 
constitute evidence of efficacy specific 
to the form of psychotherapy. 

There is little known, however, 
with regard to the attributes of 
placebo reactors. Lasagna et al. (19) 
have made the first attempts to in- 
vestigate this problem and report 
some attitudes and Rorschach cate- 
gories which differentiated their re- 
actors (N=11) from their nonre- 
actors (N=16). However, only 14 
per cent of their patients were con- 
sistent reactors, i.e., showed the ef- 
fect with every placebo dose, and 31 
per cent were consistent nonreactors, 
while 55 per cent showed the effect 
on some occasions but not on others. 
This contrasts with the findings of 
Jellineck (18) whose patients with 
headache were, for the most part, 
either in the always-relieved group o1 
the never-relieved group, with only a 
small percentage of patients showing 
The ap- 
parent contradiction in findings may 
perhaps result from the difference in 


inconsistency of response. 


the cause of the pain in the two series 
or from other factors. In any case it 
indicates that the problem is a com- 
plex one needing much more study 
In the light of these considerations, 
method of demonstrating the 
specificity of response to a given tvpe 


any 


of psychotherapy would have to pro- 
vide an adequate control design. As 
far as we know, the study which has 
paid closest attention to the question 
of controls in research in psychother- 
is that ot 
leagues (31). They emploved two dit- 
ferent kinds of control groups. One 
was a group of nonclients who were 
simply given a battery of tests before 
and after specified time periods. The 


apy Rogers and his col 








PSYCHOTHERAPY AND 


other Was a group of clients who were 
required to wait a specified period of 
time before beginning therapy. This 
group was tested at the beginning 
and end of the wait period, at the end 
of therapy, and after a follow-up pe- 
riod. 

These procedures do not control 
tor the placebo effect since neither 
control group Was being subjected 
to any special procedures which could 
produce a reasonable expectancy in 
control subjects that certain changes 
should The experimental 
group, however, could be expected 
to anticipate certain effects merely as 


occur. 


a consequence of participating in the 
chent-therapist interviews. There- 
fore, even though favorable changes 
be demonstrated in their cli- 
the question of whether these 
were placebo effects could not 


resear¢ h design 


could 
ents, 
be 
from such 
additional 


answered 
unless information were 
provided 

It we do not control for nonspecific 
like the placebo effect, we 
know whether pre- 
dicted from a theory lead to or result 
from improvement based on the non- 
Butler and Haigh (2 


for exan ple, report an in reased cor- 
relation of perceived self with ideal 


factors 


cannot effects 


specifi effect 


self tollowing client-centered therapy. 


The implicit inference is that the 
specitic therapeutic method leads to 
increased which, in 
turn, contributes to amelioration of 
disabilitv and distress. 

It is conceivable, though, that asa 
result of a nonspecific placebo effect 
the client feels less disabled and dis- 
tressed which, in turn, leads him to 
describe himself as more like his ideal 
self. Rogers’ (30) findings of greater 
emotional maturity in successfully 
treated may be similarly ex- 
plained, clients feeling less disabled 
and to 


placebo response and behaving con 


correlation 


= 
his 


Cases 


distressed due a nonspecili 
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sequently in ways which are less anx- 
iety-determined and which are seen 
as more mature by others. 

We would propose that the follow- 
ing conditions are optimal in planning 
research in psychotherapy: 

1. Atheory of personality and psy- 
chological distress (neurosis, malad- 
justment, etc.). 

2. Predictions of effects in the pa- 
tient or client consequent to psycho- 
therapy, in accord with the theory. 

3. Demonstration of a relationship 
between the predicted effects and 
some criterion of improvement. 

4. Demonstration that the pre- 
dicted effects and their relationship 
to the improvement criterion are not 
due primarily to the patient's convic- 
tion that therapy will help him. This 
will permit greater confidence that 
the relationship found is specific to 
the therapeutic 
from the theory. 

Ideally, these should 
obtain both for process and outcome 
research. There seems to be general 
agreement with regard the iirst 
two conditions although Mackinnon 
(21) has about 
beginning with a theory rather than 
a hunch. Gordon et al. (12) have 
come to question the third condition, 
at least with respect “global” 
criterion of improvement. 

The fourth condition has not been 
met in any research of which we are 
aware. It is not possible to set up an 
experiment precisely analogous to 
comparison of a medication with a 
placebo because there is no such thing 
as inert psychotherapy in the sense 
that placebos are pharmacologicall, 
inert. However, it may be possible to 
study the possible specific effects of 
any particular form of therapy by the 
use of a matched control group par- 
ticipating mm an activity regarded as 
therapeutically inert from the stand 
point of the theory of the therapy 


technique derived 


conditions 


to 


some reservations 


to a 
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being studied. That is, it would not 
be expected to produce the effects 
predicted by the theory. The 
“placebo psychotherapy” in this 
sense would be analogous to placebos 
in that it would be administered 
under circumstances and by persons 
such that the patients would expect 
to be helped by it. 

Let us say that our theory is psy- 
choanalytic and our predicted effect 
is an increased correlation between 
the moral values of the patient and 
the therapist identifica- 
tion) and that we also expect an as- 
sociation between the increased cor- 
relation and ‘a criterion of improve- 
ment (32). According to the theory, 
there is no reason to believe that con- 
trol patients rec eiving, tor example, 
relaxation therapy (17) will show the 
increased correlation of moral values 
with their therapist's moral values, 
nor should they show as much or as 
lasting improvement as the patients 
receiving psychoanalytic therapy of 
equal length. Such a design would 
constitute a fair test of the hypothesis 
based on the theory. In comparative 
studies where one ty pe of psv« hother- 


(superego 


another, dif- 
ferences found between them in pre- 
dicted effects or amount, nature, and 
duration of improvement would not 
be explainable as placebo effects, if 
the condition could be met that pa- 
tients had equal faith in the efficacy 
of the therapies and therapists to 
which they are assigned. 


apv is tested against 


SUMMARY AND CONCLUSIONS 


The literature on the therapeutic 
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etheacy of drugs compared with 
placebos is brietly reviewed, and its 
relevance for research in psychother- 
apy considered. It is concluded that 
improvement under a special form 
of psychotherapy cannot be taken as 
evidence for: (a) the 
theory on which it is based; or (6) 
ethcacy of the specific technique used, 
unless improvement can be shown to 
be greater than or qualitatively dif- 
ferent from that produced by the 
patients’ faith in the efficacy of the 
therapist and his technique—‘‘the 
placebo effect... This effect mav be 
thought of as a nonspecific form of 
psychotherapy and it may be quite 
powerful in that it may produce end- 
organ changes and relief from distress 


correctness of 


of considerable duration. 

To show that a form. of 
psychotherapy based on a theory of 
personality 
results not 


specilic 
neurosis produces 
attributable to the non- 
specific placebo effect it is not. sufti- 
cient to compare its with 
changes in receiving no 
treatment. The onl adequate con- 
trol would be another form of therapy 
in which patients had equal faith, so 
that the effect operated 
equally in both, but which would not 
be expected by the theory of therapy 
being studied to produce the same et- 
We need to learn more about 
the nature of the placebo effect, the 
and the 
attributes of patients most susceptt- 
ble or resistant to it so that we may 
obtain a better understanding of the 
role of nonspecific factors in psycho- 
therapy. 


and 


results 


patients 


plac ebo 


fects. 


conditions giving rise to it, 
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In recent vears a number of studies 
involving human Ss have been de- 
voted to testing the implications of 
certain Hullian 
the relationship between performance 
in learning situations and level of 
total effective drive (D). In these in- 
vestigations drive has been defined in 


notions concerning 


terms of scores On a manifest anxiety 
scale (41). In view of the growing 
experimental literature 
these hypotheses since their initial 
statement by Taylor (40) and Taylor 
and Spence (43), an attempt to out- 
line the theory as it is presently con- 
ceived by the lowa group and to 
evaluate the evidence concerning it 
seems to be in order. 

Before proceeding with these mat- 
ters, however, certain misunderstand- 
ings which have arisen concerning the 
purpose of this work should be men- 
tioned. First, although groups have 
been selected exclusively on the basis 
of scores on the Manifest Anxiety 
Scale (hereafter designated as MAS) 
the interest of the lowa group has 


concerning 


not been in investigating anxiety as 
a phenomenon, but rather in the role 
of drive in certain learning situations. 
Che assumption has been made that 
anxiety are related in some 
manner to drive level, but in terms 
of the major theoretical interests of 


scores 


this group, any other acceptable spec- 


ieation of drive (eg., hunger) could 
be used in experimental tests of the 
hypotheses about the effect of drive 
level. Further, as Farber (6) has 
pointed out, no attempt has ever been 
made to claim that the only difference 
between individuals receiving differ- 


ent scores on the MAS is in drive 


level or that all performance dilter- 
ences could be explained by drive. 
Undoubtedly there are many char- 
acteristics other than drive level on 
which anxious and nonanxious Ss 
differ; the investigation of these addi- 
tional properties of anxiety groups 
and their influence on performance is 
certainly both legitimate and impor- 
tant, but it simply has not been the 
interest of the proponents of the drive 
theory. 

A second point that should be clari- 
fied has to do with the MAS. The 
construction of the test was not 
aimed at developing a clinically useful 
test which would diagnose anxiety, 
but rather was designed solely to se- 
lect Ss differing in general drive level. 
Thus the question of the 
“validity” (i.e., its agreement with 
clinical judgments) is in a sense irrele- 
vant to the experimental purposes for 
which the test was developed. In 
light of this, the test might better 
have been given a more noncommittal 
label, such as a measure of emotion- 
ality, although the fact that the items 
on the scale were selected by clini- 
cians as referring to manilest anxiety 
as it is described psychiatrically does 
not make the title completely inap- 
propriate nor a relationship between 
clinical judgments and MAS scores 
unexpected. Certainly the generality 
of the experimental findings with the 
MAS would be increased if correla- 
tions were found with other defini- 
tions and such attempts will be dis- 
cussed in a later section. However, 
regardless of the results of such stud- 
ies, it should be clearly understood 
that ‘“‘manifest anxiety” has been de- 
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fined operationally only in terms ot 
test scores and will be so employed, 
unless otherwise indicated, in the 
present paper. 
Drive THEORY 

As stated earlier, the purpose ot 
the lowa group has been to investi- 
gate the effects of varving drive level 


on performance in learning situa- 


Actual experimentation has 
involved two independent problems: 


tions. 


(a) specification of the conditions un- 
der which drive differences are said 
to appear, and (6) the theory con- 
cerning the effects of drive level on 
behavior once drive has been aroused. 
The first problem concerns the pos- 
tulated relationship between the MAS 
and drive level, the second between 
drive (or anxiety) level and perform- 
ance in various situations. Since the 
two are separate matters, an outline 
of the theory concerning the influence 
of drive will be given first and the 
hypothesized relationship between 
drive and MAS scores considered at 
a later point. 

According to Hull (15), all habits 
(77) activated in a given situation 
combine multiplicatively with the 
total effective drive state (D)) operat- 
ing at the moment to form excitatory 
potential F[E=f{(47XD))}. Total et- 
fective drive, in the Hullian system, 
is determined by the summation of 
all extant need states, primary and 
secondary, irrespective of their source 
and their relevancy to the type of 
reinforcement employed. 
sponse strength is determined in part 
by E, the implication of varying 
drive level in anv situation in which 
a single habit is evoked is clear: the 
higher the drive, the greater the value 
of EF and hence of response strength. 
Thus in simple noncompetitional ex- 
perimental arrangements involving 
only a single habit tendency the per- 
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formance level of high-drive Ss should 
greater than that for low-drive 
groups. 

Higher drive levels should not, 
however, always lead to superior per- 
formance (1.e., greater pre ybability ot 
the appearance of the correct 
sponse). In situations which a 
number of competing response tend- 
encies are evoked, only one of which 
is correct, the relative pertormance ot 
high and low drive groups will depend 
upon the number and comparative 
strengths the various 
tendencies. Predictions concerning 
the performance of the groups in such 
complex tasks involve the introduc- 
tion of additional Hullian concepts: 
oscillatory inhibition (QO) and thresh- 
old (L). 

The concept of O was introduced 
by Hull (15) in an attempt to allow 
for statement, within his system, ot 
the intra-individual variability 
behavior that presumably, 
because of uncontrolled 
from instant instant within the 
organism and in his environment. 
The value of O is said to vary from 
moment to moment, the distribution 
of O values for a group of (like) indi- 
viduals on anv trial forming a normal 
probability function. O is further as- 
sumed to play an inhibitory role, its 
value being subtracted from excita- 
tory potential (£), thus yielding 
momentary excitatory potential (£). 
In order for E to activate a response, 
it must attain a minimum or thresh- 
old value (ZL), a value that is pre- 
sumably the same for all similar habit 
tendencies evoked in a given situa- 
tion. Thus R=f(F£)={(£-0-L). 

In any task in which a stimulus 
tends to evoke a number of compet- 
ing responses the response that will 
appear on a given occasion will be the 
one with the highest suprathreshold 
momentary excitatory strength (£) 


be 


rt- 


in 


ot response 
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to 
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at that moment. Other things being 
equal, of course, the response with 
the greatest /7 and hence E value will 
have a greater probability of occur- 
ring than any other response. 
Adding the notion of differing drive 
level to this conception, we see that 
the probability of appearance of the 
correct response involves an interac- 
tion between drive level and the num- 
ber and comparative strengths of the 
correct and = incorrect tendencies. 
When the correct response is weaker 
(i.e., has less #7) than one or more of 
the competing response tendencies, 
high-drive groups should be inferior 
in performance to low-drive Ss. That 
is, because of the multiplicative rela- 
tionship between habit strength and 
drive, the stronger incorrect tenden- 
cies gain relatively more FE than the 
correct tendency in the case of high 
drive Ss than in low drive, thus lead- 
ing to a greater probability of occur- 
rence of one of the stronger incorrect 
the high-drive group. 
Further, the possibility exists that 
under a high-drive level new compet- 
ing responses with very weak habit 
strengths mav be brought over the 
threshold walue of F with the conse- 
quence that the probability of occur- 


responses 1n 


rence of the correct response is low- 
ered relative to that 
condition. 

At the other extreme, the correct 
response tendency may be highest in 
the hierarchy and relatively strong 
when compared to the incorrect. In 
such a situation, which is comparable 
to the case in which but a single habit 
is aroused, the / value for the correct 


in a low-drive 


response would be relatively greater 
than the other responses in the hier- 
archy for the high-drive group than 
for the low-drive, leading to the pre- 
diction of the superiority of perform- 
ance of such subjects. 

It should be obvious, then, that 
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maximum inferiority of high-drive Ss 
would be expected when a large num- 
ber of competing tendencies are pres- 
ent and the correct tendency is both 
relatively weak and low in the hier- 
archy. As the strength of the correct 
tendency increases relative to the in- 
correct, high-drive groups should be- 
come inferior and eventually 
superior in performance to low-drive 
groups. The exact point of equality 
would be difficult to specify. Even 
when the correct response is highest 
(though not strongly dominant) in 
the hierarchy, high-drive Ss could 
still conceivably be inferior in some 
instances since a greater number of 
suprathreshold tendencies could more 
than offset the advantage of the rela- 
tively higher FE value of the correct 
response for these individuals.! 

An important consideration that 
should be noted about making pre- 
dictions concerning the effect of drive 
level upon performance in actual ex- 
perimental situations is that a be- 
havioral the situation 


less 


analysis ot 


must have been made; only in experi- 
miental arrangements in which the re- 
sults, independent ot drive level, per- 
mit statements in terms of competing 
S-R tendencies are deductions from 


While the ma- 


of investigations designed to 


the theory possible. 
JOTILN 


In a recent review Child (3) incorrectly 
interpreted the theoretical analysis outlined 
above as involving the sudden introduction of 
© and L for the situation in which the correct 
response is highest in the hierarchy. These 
concepts are of course assumed to be operating 
in all situations, including the noncompeti- 
tional which but a single response 
tendeney is being evoked. No appeal was 
made to these constructs in the latter instance, 
however, since their inclusion would not aftect 
the predictions. Mention might also be made 
ot other constructs in the Hullian system 
ew., J, V, K, etc.): it has been assumed that 
these are of equal value for all drive groups 
and that a consideration of their values would 
not result in changing anv prediction 


one in 
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test implications of these derivations 
concerning drive level have utilized 
tasks for which analyses in S-R terms 
had already been made and found to 
be useful, occasionally an experiment 
appears in which the investigator at- 
tempts to evaluate the total theory 
by comparing groups on a task which 
is poorly understood (and for which 
little or no rationale is presented) or 
which clearly involves the introduc- 
tion of variables not included in the 
theory. The accumulation of empiri- 
cal evidence concerning the perform- 
ance of different groups in any situa- 
tion or attempts to incorporate addi- 
tional variables within any theoreti- 
cal framework are certainly to be en- 
couraged, but statements that the 
results of such studies refute or con- 
firm theoretical expectations are un- 
warranted when there is no evidence 
that the boundary conditions im- 
posed by the theory are met. 


DRIVE AND ANXIETY 


The use of the MAS to select 
groups that are postulated to differ 
in drive level in an experimental situ- 
ation has rested on the assumption 
that scores on the scale are in some 
manner related to emotional respon- 
siveness, which, in turn, contributes 
to drive level. Two alternative hy- 
potheses have been entertained con- 
cerning the conditions under which 
emotionality is evoked. One is that 
test reflect differences in a 
chronic emotional state so that in- 
dividuals scoring high on the scale 
tend to bring a higher level of emo- 
tionality or anxiety the 
with them than Ss scoring at 
lower levels (40). A second alterna- 
tive conception is that MAS scores 
reflect different potentialities for anx- 
iety arousal, high scoring Ss being 
those who tend to react more emo- 
tionally and adapt readily to 


scores 


‘in door” 


do 


less 
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novel or threatening situations than 
do low scorers (28, 37). According 
to the first hypothesis differences 
among anxious and nonanxious groups 
(providing other conditions imposed 
by the theory are met) should be 
found whether or not there is any 
“threat,”’ in the form of noxious stim- 
ulation, fear of failure or the like, in 
the situation. Thus, for example, the 
performance of anxious Ss should be 
superior to the nonanxious in both 
classical defense conditioning, in which 
a noxious stimulus ts emploved, and 
in reward conditioning into which no 
objective threat has been introduced. 
In the case of the second conception, 
differences would be expected in the 
performance of anxiety groups only 
in those situations in which 
threat is present. Should this be the 


correct 


some 
conception, exact specifica- 
tion of the conditions thought to be 
sufficient to evoke anxiety would be 
necessary in order to test hypotheses 
concerning the role of drive. Avail- 
able evidence suggests that the mag- 
nitude of differences among groups 
mav be related to the level of noxious 
stimulation emploved (37), or to 
stress-producing instructions (10,19), 
suggesting that differences in drive 
level among groups may depend at 
least in part upon situational factors 
However, the picture is complicated 
by the results of a number of studies 
in which differences among anxiety 
groups have been found in the ab- 
noxious stimulation or in- 
structions designed to produce stress 
8, 24, 25, 26, 42). 

Most investigators have not 
plicitly considered this issue, assum- 
ing either that anxiety scores reflect 
a chronic level of emotionality or 
that factors are present in the typical 
laboratory experiment that result in 
different anxiety levels among groups. 
For purposes of evaluating those stu 


sence of 


€i- 
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dies in which degree of stress has not 
the 
sumption will tentatively be made 
here that in all situations, individuals 
scoring high and low on the anxiety 
scale will differ in drive level, for 
whatever reason. The evidence more 
directly concerned with the condi- 
tions of anxiety-arousal will be con- 
sidered at a later point. 


been under investigation, 


as- 


EXPERIMENTAL EVIDENCE 


Classical conditioning. Classical 
conditioning is said to be a noncom- 
petitional situation in which but a 
single response tendency is being ac- 
quired; theoretical expectation there- 
fore is that anxious groups will per- 
form 
lous. 


at a higher level than nonanx- 
he results of a number of stud- 
evelid using 
with extreme the 
MAS? have upheld these predictions, 


ies of conditioning 


groups scores on 
anxious Ss showing a greater number 
of CR’s than nonanxious (11, 35, 37, 
38, 39, 40). In all cases but one (ll ‘ 
these differences were statistically sig- 
nificant, the exception involving the 
use of only 10 Ss per group, consider- 
ably fewer than were employed in 
other investigations. Data from eve- 


lid conditioning studies performed in 
the lowa laboratories and elsewhere 
(39) are also available from Ss scor- 


ing throughout the entire range of 
anxiety scores rather than only at the 
two extremes. The relationship be- 
tween anxiety and conditioning scores 
has been uniformly found to be mono- 
although not always linear, 
middle-anxiety Ss tending to show a 


tonic 


2 In almost all of the studies involving the 
MAS, a comparison has been made of extreme 
scorers, typically the 20th percentile or below 
nonanxious) and 80th percentile or above 
anxious) in terms of a standardization group 
41). Use of the terms 
‘anxious’ and ‘“‘nonanxious” groups here 
should be understood to refer to such ex- 
tremes unless otherwise indicated. 


of ec lege stude nts 
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performance level closer to the low- 
scoring than the high-scoring groups 
The magnitudes of the correlation 
coefficients obtained have been in the 
neighborhood of .25, thus indicating 
that relatively little of the variance 
among Ss can be accounted for in 
terms of anxiety scores. In view of 
the low correlation and the mono- 
tonic relationship between the two 
variables, continued use of extreme 
groups only for research purposes in 
such situations seems justified. 

A conditioning study emploving a 
response other than the eyeblink has 
also been reported in the literature. 
An investigation by Bitterman and 
Holtzman (1) utilized the PGR tech- 
nique which, like the eyelid situation 
it will be noted, involves defense con- 
ditioning. After dividing a group of 
randomly selected college students 
into the upper and lower 50°% on the 
basis of MAS scores, these investiga- 
tors found a slight but statistically 
insignificant superiority in condition- 
ing level on the part of their anxious 
Ss. Since their anxious group in- 
cluded individuals with con- 
siderably lower than those in the in- 
vestigations referred to above, this 
lack of statistical significance is not 
too surprising. 

Several studies are available con- 
cerning differential conditioning, also 
in the eyelid situation (11, 34, 36). 
The predictions derived from the 
theory in this instance are that anx- 
ious Ss should exhibit a greater excita- 
tory strength both to the positive 
(reinforced) CS and to the negative 
(nonreinforced) CS and further, that 
the difference in excitatory strengths 
of the two stimuli should be greater 
for the anxious group. By transform- 
ing all raw data into excitatory 
strength values, Spence and his col- 
leagues (34, 36) have attempted to 
test these predictions in some five 
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separate instances. In each case, the 
excitatory strength to the positive CS 
during differential conditioning was 
significantly greater for anxious Ss, 
as was expected. The results con- 
cerning the remaining two predictions 
were not so clear-cut. In four out of 
five independent instances the excita- 
tory strength to the negative stimulus 
Was greater for the anxious Ss but in 
no case was the difference significant. 
In all tive cases the difference be- 
tween excitatory strengths was in the 
expected direction but was significant 
in only one instance. While the re- 
sults of these studies tend to lend 
some support to the theory, some- 
what contradictory findings have 
been reported by Hilgard, Jones, and 
Kaplan (11). As mentioned earlier, 


contrary to other studies of simple 
evelid conditioning, these investiga- 
tors found only a slight, statistically 
insignificant superiority for anxious 
Ss during training to the positive CS. 


During differential conditioning, the 
anxious group continued to exhibit 
an insignificant superiority to the 
nonanxious on the positive CS. How- 
ever, the responses of the anxious Ss 
to the negative CS were significantly 
greater as would be expected by drive 
theory. 

Stimulus generalization. Stimulus 
generalization, to which differential 
conditioning is related, has been in- 
vestigated more directly by Rosen- 
baum (28) and Wenar (48). Rosen- 
baum found greater responsiveness to 
generalized stimuli in a spatial situa- 
tion for an anxious group than for a 
nonanxious group, as would be pre- 
dicted by drive theory, but only in 
the case of Ss given strong intermit- 
tent shock during their performance ; 
for groups given a weak shock or 
buzzer, no significant differences 
emerged. After training groups of 
anxious and nonanxious Ss on a key- 
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pressing response to a strong shock, 
weak shock or a buzzer presented at 
regular intervals, Wenar (48) meas- 
ured the reaction time to these stim- 
uli in a test series in which the inter- 
vals of presentation were longer or 
shorter (temporal generalization) 
than those emploved during training. 
Reaction time related = signifi- 
cantly to both stimulus intensity and 
anxiety level, response time being 
quicker as these variables increased. 

Maze learning. The first study to 
be concerned with demonstrating 
that the relative performance of anx- 
ious and nonanxious Ss is a function 
of degree of interference within a 
task was reported by Taylor and 
Spence (43), who used a type of serial 
verbal maze. On the assumption 
that errors in such a situation are 
largely the result of interfering re- 
sponse tendencies, due to remote as- 
it was expected that 
anxious Ss would make more errors 
and take more trials to reach a cri- 
terion than nonanxious. The results 
of this study and of a subsequent in- 
vestigation by Farber and Spence 
(8) with a stvlus maze have confirmed 
these hypotheses, the greater number 
of errors and trials to criterion being 
made by the anxious groups. An ad- 
ditional prediction was also made for 
these maze data, namely that the de- 
gree of inferiority of the anxious Ss 
in comparison to the nonanxious 
should be positively related to difh- 
culty of the choice point. In both 
studies, significant rank-order 
relations were obtained between the 
difference in number of errors be- 
tween groups on an individual choice 
point and the difficulty of that point. 
Although these results tend to con- 
firm theoretical expectation, some 
discrepancy between prediction and 
the experimental findings occurred 
on the easiest choice points. In each 
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investigation, the small number of 
errors on the three 
points suggests the presence of few 
interfering that the 
anxious might be expected to be su- 
perior in performance. Even here, 
however, they tended to be inferior. 

In addition to the two studies uti- 
lizing extreme groups, one study of 


easiest two or 


tendencies so 


stvlus maze learning involving the 


anxiety has 
been reported. After splitting a ran- 
domly selected group of college stu- 
dents into 7 anxiety groups according 
to their MAS scores, Matarazzo et al. 
(24) found a_ linear 
(y=.25) between anxiety and trials 
to the criterion on the maze. 

While the investigations reported 
above have found differences between 
anNiety groups on maze performance, 
Hughes, Sprague, and Bendig (14), 
utilizing extreme failed 
duplicate these results with several 
serial verbal mazes. Different from 
the Taylor and Spence study in which 
the typical 2-second rate of stimulus 
presentation was emploved, Hughes 
et al. used a 4-second rate in all cases. 
Previous investigations have demon- 
strated (12) that performance is posi- 
tively related to the interstimulus in- 
terval in serial learning but since the 
effects of this variable are poorly un- 
derstood, the implications of the fail- 
ure to find differences between anx- 
iety groups with the 4-second condi- 
tion are not clear. One possibility, 
based on the assumption that differ- 
in anxiety level are largel 
determined by situational factors, is 
that under longer time intervals, 
stress upon Ss, and hence upon dit- 
ferences in emotionality between 
anxious and nonanxious, is mini- 
mized. 

Verbal learning. Rather than at- 
tempting to demonstrate an interac- 
tion between anxietv level and degree 
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of interference by examining individ- 
ual items within a single task, as was 
done in the maze studies, Montague 
(25) formed three different lists ot 
serial nonsense syllables which, be- 
cause of varying degrees of formal 
intralist similarity and 
value of the syllables, presumably dit- 
fered in the amount of intralist inter- 
ference. A significant interaction was 
found between anxiety and list, an 
anxious group being significantly su- 
perior in performance to nonanxious 
on the list for which similarity was 
low and association value high, and 
the position being reversed for groups 
given a list of high similarity and low 
value. Similar findings 
have been reported by Lucas (19) in 
a study in which Ss were asked to re- 
call lists of consonants read to them. 
As the number of duplicated conso- 
nants within a list was increased, anx- 
ious Ss showed a significant decrease 
in the amount recalled while the per- 
formance of the nonanxious was not 
affected. 

While a number of investigators 
have emploved serial learning tasks, 
from the point of view of testing the 
drive theory, the 
paired-associate technique seems to 
be preferable. Whereas intralist in- 
terferences due to such factors as re- 
mote associations are inherently part 
of serial learning and are thus difficult 
to manipulate, the use of discrete 
S-R pairs permits more precise control 
of the number and strength of the 
response tendencies elicited by each 
stimulus. Turning to the investiga- 
tions that have emploved this paired- 
associate arrangement, several stud- 
ies have attempted to minimize the 
presence of competing response tend- 
encies and thus to demonstrate the 
performance superiority of anxious 
Ss. In one, Taylor and Chapman 
(42) chose nonsense svllables with 


association 


association 
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low formal similarity, in an attempt 
to provide a noncompetitional ar- 
rangement in which each stimulus 
tended to evoke only its own re- 
sponse. As expected, on two lists for 
which such low similarity obtained, 
anxious Ss were significantly superior 
in performance to nonanxious. Simi- 
lar superiority of anxious Ss has been 
reported by Spence (33) on an adjec- 
tive list in which the association be- 
tween each S-R pair was presumed to 
be initially strong and minimum sim- 
ilarity existed among pairs. In a sec- 
ond part of this investigation, an at- 
tempt was made to maximize the 
number of competing tendencies by 
having a high degree of svnonymity 
among stimuli. As predicted, an anx- 
ious group in this case was inferior. 

The initial strength of association 
between S-R was also manipulated 


by Ramond (26 


in an investigation 


involving a variation of the standard 
paired-associate 


technique. ach 
stimulus, an adjective, had connected 
with it two words, 
judged to be highly associated with 
the stimulus and the other with no 
discernible association. Each type of 
response was correct for half of the 
items. When the low association re- 
sponses were correct, anxious Ss were 
expected to perform at a lower level 
than nonanxious because of the 
greater interference of the strong, in- 
correct response for this group. The 
results confirmed this prediction. 
lheoretical expectations for the situ 
ation in which the stronger response 
was correct are not so clear-cut since 
the arrangement of the list made it 
likely that as learning took place the 
low association responses would inter- 
fere occasionally with the high asso- 
ciation response because of stimulus 
generalization. Thus, while anxious 
Ss might be expected to be superior 
early in learning, they might lose this 
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superiority as the weak responses are 
learned and provide competition. 
The results lent some support to these 
expectations, anxious S first being 
superior and then inferior to nonanx- 
ious although the over-all difference 
between groups did not reach statisti- 
cal significance. 


ANXIETY SCORES AND THEIR 
RELATIONSHIP TO STRESS 


As was indicated earlier, two alter- 
native hypotheses have been enter- 
tained concerning the difference be- 
tween Ss scoring high and low on the 
MAS with respect to anxiety: that 
such groups have different levels of 
chronic anxiety or that the groups 
instead differ in their emotional 
reactiveness to anxiety-evoking stim- 
uli present in a situation. 

The studies of verbal learning just 
discussed indicate that whether due 
to chronic or situational factors, dif- 
ferences between high and low scor- 
ing Ss cannot be said to be produced 
only when stress is deliberately in- 
troduced into the situation, either by 
means of noxious stimulation as in 
the case of defense conditioning or by 
the administration of stress-provok- 
ing instructions (e.g., reports of fail- 
ure). Consideration of the studies 
into which some threatening stimula- 
tion has been introduced may, how- 
ever, throw some light onto the ques- 
tion as to whether differences in anx- 
iety among groups could depend, at 
least in part, on situational variables. 

Should situational factors play a 
role in determining differences in 
emotionality among anxiety groups, 
the strength of the UCS in classical 
conditioning might be expected to be 
related to such group differences. A 
comparison of three experiments of 
from the lowa 
laboratory involving a_ relatively 
strong, medium, and mild UCS, re- 
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spectively, was made by Spence and 
Farber (35). Examination of the 
mean conditioning scores reveals that 
while intensity of the UCS tended to 
be related to performance, the magni- 
tude of the difference between anx- 
ious and nonanxious remained rela- 
tively under the different 
Different results were ob- 
tained by Spence and his associates 
(37) in 


taken 


constant 
intensities. 
a study specifically under- 
to evaluate the effect of the 
strength of noxious stimulation 
anxiety groups. In this investigation 


on 


the 6. selected without reference to 
their anxietv scores, were conditioned 
with a relatively weak UCS, but one 
group was given occasional electric 
shocks between trials, another threat- 
ened with shock, and a third trained 
under neutral conditions. These lat- 
ter Ss, run under neutral conditions, 
gave fewer CR’s than the other 


groups, especially in earlier trials. 
When Ss were later divided into the 
upper and lower 50 per cent accord- 


was found 
group 
threat 
of shock exhibited only a slight, sta- 
tistically insignificant superiority in 
conditioning performance, the differ- 
between groups was 
highly significant for Ss with whom 
shock or threat of shock em- 
ploy ed. 

The previously mentioned studies 
of stimulus generalization by Rosen- 
baum (28) and Wenar (48) were also 
concerned with variations in the in- 
tensity of noxious stimulation, in 
both cases a buzzer and two intensi- 
ties of shock being emploved. While 
Rosenbaum found a significant dif- 
ference between groups only when 
strong shock was used, Wenar’s re- 
sults (with a somewhat different ex- 
perimental arrangement) indicated a 
greater responsiveness for the anx- 


ing to anxiety scores, it 
that while the 
conditioned without shock or 
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ious group under all three conditions. 
Furthermore, the magnitude of the 
difference between groups was unaf- 
fected by stimulus intensity. 
Turning to verbal learning, Deese, 
Lazarus, and Keenan (4) have re- 
ported a study in which the effect of 
electric shock on serial learning was 
investigated. Here it was found that 
nonanxious groups given intermittent 
shocks performed at a significantly 
lower level than a nonanxious control 
group run under neutral conditions. 
In contrast, the performance of the 
anxious groups remained relatively 
constant, Ss run under shock not dif- 
fering from their control group. Fur- 
ther, when all conditions were com- 
bined, the performance of the anxious 
was significantly superior to the non- 
anxious.* Thus, while the differences 
between groups increased under 
shock, they were due to the disrup- 
tive effect of the shock on the non- 
anxious Ss. 
Quite in contrast to the results of 
Deese et al. are the tindings obtained 
\lthough, presumably, the serial list was 
of relatively low intralist similaritv, it is 
difficult to tell from the writers’ description 
what drive theory would have predicted 
concerning the performance of the anxiety 
groups, independent of the stress factor. Ina 
second, parallel, experiment involving a 
more difficult list (12 consonant syllables 
composed of only 5 consonants) presented for 
a standard 12 trials, Lazarus, Deese, and 
Hamilton (17) found no differences among 
groups either as a function of anxiety scores 
or of shock-no-shock conditions. While 
these results appear superficially to be con- 
tradictory both to drive theory (which would 
expect inferiority of anxious Ss) and to the 
results of the first study with respect to the 
influence of shock, inspection of their data 
indicates that all groups averaged only about 
one correct response per trial. Since so little 
learning took place it is not surprising to have 
no differences in performance among groups. 
For this reason it is felt that the study does 
not provide very meaningful evidence on the 
effects of either anxiety level or shock on task 
performance. 
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by Gordon and Berlyne (10) in an 
investigation of verbal learning utiliz- 
ing psychological stress rather than 
noxious stimulation. After being told 
that the tasks were measures of intel- 
ligence and that their performance 
on a paired-associate list was above 
anxious and 
groups did not differ significantly in 
amount of negative transter on a sec- 


average, nonanxious 


ond paired-associate list. An anxious 
group told that their first list per- 
formance was below 
ever, exhibited significantly more 
negative transfer than did a compara- 
ble Finally, 
the Lucas study (19) mentioned ear- 
lier in which the recall of consonants 
lists varying in number of duplica- 
tions was investigated, the effects ot 
varying 


average, how- 
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numbers of reports of failure 
to meet expected standards were also 
studied. While nonanxious Ss 
creased the amount recalled) with 
greater numbers of failure experi- 
ences, the anxious groups did signifi- 
cantly worse. 

As may be seen, the available evi- 
not present a clear-cut 
picture with respect to the effects of 
Summarizing first those 
vestigations involving noxious stimu- 
lation, the results indicate that with 
one exception (4) the performance of 
all Ss tends to be affected in the same 
direction as ts found with an increase 
in anxiety (MAS) level. The magni- 
tude of the difference between anx- 
and Ss either re- 
mains constant with greater degrees 
of stimulation or is increased. The 
data from the two studies employing 
psychological stress (in both cases 
detined by telling S he had failed to 
achieve adequate standards on an 
intelligence test) have revealed some- 
what different relationships. In both 
instances (10, 19) the performance 
of anxious Ss under stress was sig 


in- 


dence does 


stress. in- 


ious nonanxious 


Al 





TAYLOR 


nificantly worse than the anxious 
group tested under neutral condi- 
tions while the performance of non- 
anxious Ss was in one case the same 
and in the second better than the 
control group. Thus, the magnitude 
of the difference anxiet' 
groups Was greater under stress than 
under neutral conditions. 

The available 
then that situational sources of stress 
may play a role in determining the 
difference in anxiety level between Ss 
scoring at the extremes of the MAS. 
Whether the differences 
groups in the verbal learning studies 
into which no objective stress had 
been introduced by the experimenter 
reflect chronic anxiety level or uni- 
dentitied threat) remains 
an open question.  Speculating on 
this point, to many 
omores psychology experiments per 
se may be seen as somewhat threaten- 
ing, particularly when the task could 
be interpreted as reflecting on their 
personality or intelligence. It is per- 
fectly possible that in experimental 
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sources ot 


( ollege Pal yph- 


arrangements involving no noxious 
stimulation 
structions which call upon skills not 


or stress-inducing — in- 


by 


between 


particularly valued college stu 
dents, differences 
might disappear. ' : 

Using the results of these studies 
involving stress to attempt to deter 
mine the sours: of anxiety differences 
he tween high- and low-s¢ oring Ss or, 
for that matter, to test drive theory, 
involves the assumption that the only 
efiect of stress in any 


groups 


situation is to 
increase drive level or, at least, that 


4 \ study of classical reward conditioning of 
the salivary by Bindra, Paterson, 
and Strzelecki (On the relation between anx 
iety and conditioning, Canad. J. Psychol., 1955, 
9, 1-6) which appeared after this review was 


response 


written confirms this suggestion. No difference 
was found between anxious and nonanxious 


groups 
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anxious and nonanxious groups do 
not respond differentially to stress 
except with respect to anxiety or 
drive. Aithough no systematic CN 
ploration has been made of the rela 
tionship between degree of noxious 
stimulation and performance on vari- 
ous types of tasks, an examination of 
the general literature concerning the 
effect of such stimulation in nonver- 
bal, noncompetition situations lends 
some credibility to this assumption 
(32). It is important to note that 
with one exception (4) the studies ot 
the effects of noxious stimuli on anx- 
and nonanxious Ss have em- 
ploved tasks of this type. 

In contrast, the literature concern- 
psychological stress 
e.g., ego-involving instructions, re- 
ports of failure), most of which have 
emploved quite complex tasks, sug- 
that than or in 
addition to drive level are involved. 
Phe variety of roles or effects that 
stress may addition to the 
motivational one has been discussed 
by Deese, and Osler (18 
and recently by Farber (7). 
Particularly pertinent to the present 
discussion is the finding that there 
wide individual differences in 
response to such stress, some individ- 


wus 


ing studies of 


factors other 


vests 


have in 


Lazarus, 
more 


are 


uals improving in performance, others 
decreasing, and still others being un- 
affected. The direction of the effect 
of stress has further been related to 
personality variables (18). 
The Ss scoring at the extremes of the 
MAS continuum mav react to such 
with characteristically differ 
ent patterns as well. Thus, it is possi- 
ble that 
differences 
and nonannious other than drive may 
be aroused and become responsible, 
at least in part, for the discrepancy 
between the performance levels ot 


several 


stress 


with increasing degrees ot 


between anxious 


stress, 


such groups. 
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Unfortunately, the two available 
studies involving psychological stress 
do not permit an evaluation of this 
suggestion (nor of the possibility that 
stress of any type, physical or psycho- 
logical, mav have a similar effect in 
tasks of sufficient complexity). Both, 
it will be recalled, used learning tasks 
of such a type that an increase in 
drive level might be expected to re- 
sult in deterioration of perlormance. 
Thus, it could be argued that the anx- 
were “threatened” (had their 
drive level increased) by the stress 
instructions and hence deteriorated in 
performance in comparison to their 
neutral control group while the fact 
that the nonanxious under stress did 
not show a similar inferiority merely 
indicates that they were emotionally 
unaffected by the stress conditions. 
The only hint that more might be in- 
volved than drive level is contained 
in the Lucas in which 
anxious improved with a greater 
number of failure experiences while 
the anxious became worse. Such a 
tinding further that these 
additional factors, if anv, might act 
in the direction of interfering with the 
| anxious Ss and of 
facilitating the performance of non- 
anxious. Additional research upon 
the effects of stress on anxiety groups, 
particularly with tasks of different 
levels complexity Is certainly 
needed to provide information about 
these possibilities. 

The suggestion that at least psy- 
chological stress may have other than 
drive effects on anxious and nonanx- 
ious Ss in complex tasks bears some 
resemblance to the empirical predic- 
tions propesed by Sarason and Man- 
dler and their associates (22, 23, 29) 
tor the performance of groups selected 
by a different measuring instrument, 
a questionnaire of “test anxiety,” 
designed to select individuals react- 
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ing with different degrees of anxiety 
to intelligence tests and course ex 
aminations. These investigators hy- 
pothesized that such high-anxious 
individuals react to an experimental 
situation represented as a test of in- 
telligence or the like (thus, according 
to their conception, creating stress) 
not only with more anxiety or drive 
than low-anxious but also, as a result 


of past learning, have evoked by 


their anxiety irrelevant response tend- 
encies which interfere with task per- 
Under increasing stress 
(such as reports of failure) the per- 


formance. 


formance of high-anxious Ss worsens 
because of the arousal of a greater 
number of these irrelevant tenden- 
cies, offsetting the facilitating effects 
of drive; the performance of the low- 
anxious, however, improves” with 
greater stress due to an increasing 
drive level, unaccompanied by irrele- 
vant tendencies. Such a theory, al- 
though predicting the same results as 
would be expected from the notions 
being put forward here about the ef- 
fect of stress on the performance ot 
anxious and nonanxious in complex 
tasks, differs from these suggestions 
in several ways. In contrast to drive 
theory, Sarason and Mandler seem 
to imply that other things being 
equal, heightened drive always results 
in raising performance, independent 
of the type of task involved. Further 
they propose that the effect of stress 
is to evoke certain disruptive re- 
sponse patterns in addition to drive 
only for high-anxious Ss while the 
suggestion of the present writer is 
that additional factors may be elic- 
ited under stress for both anxiety ex- 
tremes although their effects on per- 
formance may be in the opposite 
direction. 

Although and his col- 
leagues have confined their interests 
to “test anxiety” and its effects, 
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primarily, on intelligence-test: items 
under stressful conditions, Child (3) 
has proposed that all the work done 
with Ss scoring at the extremes on 
the MAS, independent of whether 
stress is introduced, could be more 
plausibly explained by such an inter- 
ference theory. These task-irrelevant 
responses are always present in anx- 
ious Ss, as well as a higher drive level, 
Child states, but they disrupt per- 
formance only in complex situations 
“where the subject is already in con- 
flict between various response tend- 
encies relevant to the task [so that] 
the of irrelevant 
tendencies heightens the conflict and 
interferes with performance to a 
greater extent than increased drive 
improves it’’ (3, p. 154). 

It would appear the present 
writer that a theory that attempts to 
attribute all interiority of perform- 
ance to irrelevant tendencies would 
either be forced to predict that anx- 
ious Ss would always be inferior to 
nonanxious in such complex tasks as 
verbal learning (since it seems hard 
to maintain that even with verbal 
materials having little intratask in- 
terference, irrelevant extratask re- 
sponses could not interfere with per- 
formance) or, if already obtained re- 
sults are to be explained, that anxiety 
level and its correlated irrelevant re- 
sponse tendencies would shift up and 
down abruptly from task to task and 
even from stimulus to stimulus within 
a task as the number of competing 
response tendencies directly elicited 
by a stimulus varied. Tieing the 
number of extratask responses to the 
number of intratask 
would seem merely to be adding one 
more variable to those considered by 
drive theory without making differ- 
ent predictions in the situations to 
which drive theory has been thought 
to be applicable. 
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It is interesting to note that the 
suggestions being proposed here con- 


cerning the possible role of response 
as well as drive differences in the 
performance of anxious and nonanx- 
ious Ss in stress situations leads to a 
different prediction then do Child's 
hypotheses in certain cases. Accord- 
ing to the present writer, on verbal 
tasks in which anxious Ss are demon- 
strated to be superior to nonanxious 
under neutral conditions, the intro- 
duction of stress might be expected 
to minimize this difference between 
groups or even to reverse its direc- 
tion, the performance of anxious 
Ss being lower than under neutral 
conditions and the nonanxious possi- 
bly Child, while per- 
haps also expecting nonanxious Ss to 
better under than under 
neutral conditions, would be forced 
to predict that anxious group 
under stress would be the same as or 
even superior to its neutral control 
group rather than worse. That is, the 
fact that under neutral conditions the 
anxious Ss perform at a higher level 
than would indicate, 
according to Child, that this was a 
situation in which making irrelevant 
responses does not interfere with task 
performance, the difference between 
groups in favor of the anxious being 
due, then, to their higher drive. 
While stress might increase the drive 
level of anxious Ss and hence the 
magnitude or number of the task- 
irrelevant these latter 
would still not compete with task- 
relevant responses since the task is 
the same. 

Still another interpretation of the 
relationship between anxiety and 
stress has been suggested, the pre- 
dictions of which are quite opposed 
to any of those previously discussed. 
On the basis of their findings with 
serial learning that the performance 
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of nonanxious groups deteriorated 
with shock while that for the anxious 
did not, Deese, Lazarus, and Keenan 
(4) suggested that the MAS measures 
not so much anxiety as how indi- 
viduals defend themselves against 
anxiety, and further, that MAS scores 
are related to the hvsteria-psychas- 
thenia continuum. The latter pro- 
posal arose from the finding that 
(with overlapping items excluded) 
there was a positive correlation of 
40 between the MAS and the Psv- 
chasthenia (Pt) scale on the MMPI 
and a —.23 correlation between the 
MAS and the Hysteria (J/y) scale. 
By assuming that nonanxious Ss are 
hysterical individuals who are unable 
to maintain their defenses in the face 
of objective inescapable stress (e.g., 
shock, as opposed to psychological 
stress), and therefore are greatly dis- 
turbed by it while the anxious are 
psvchasthenic and therefore react to 
objective threat coolly and intellectu- 
ally, they believe their results become 
intelligible. The same explanation 
has been offered by Eriksen (5), who 
found that Ss scoring high on the Hy 
scale exhibited more stimulus gen- 
eralization in an investigation involv- 
ing shock than did high Pt Ss. These 
results, Eriksen stated, were inex- 
plicable in terms of drive theory. In 
attempting to evaluate these hy- 
potheses (and leaving aside any ques- 
tions of the clinical validity of the 
various measures employed) it might 
be well to inject a historical note. In 
developing a scale for the selection of 
Ss, the present writer deliberately 
attempted to include items descrip- 
tive of overt or manifest anxiety and 
avoided including items describing 
behavior not itself ‘‘anxious’”’ but 
said to be a defense against an in- 
ternal anxiety precisely because it 
was the purpose of the scale to select 
Ss differing in functioning anxiety 
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level in the experimental situation; 
to the extent that defenses were effec- 
tive in keeping anxiety at a mini- 
mum, inclusion ot items” 
on the scale would have been self- 
defeating. 

The conflict between the hypothe- 
sis of Eriksen, Deese, et a/., and the 
assumptions made by drive theorists 
in using the MAS is not whether some 
individuals scoring low on the scale 
are potentially anxious individuals 
with good defenses, but rather 
whether the introduction of special 
conditions such as shock so affect a 
sufficient number of low scoring Ss 
as to wipe out or reverse the direc- 
tion of difference in drive or emo- 
tionality between low- and high-scor- 
ing groups that exists under neutral 
conditions. If Ss are thus affected, 
drive theorists must either abandon 
the MAS tor a different selective in- 
strument, or restrict themselves to 
testing groups in situations in which 
defenses are assumed to be operating. 

An examination of the available 
evidence suggests that no modifica- 
tion of the postulated relationship 
between anxiety drive 


“defense 


scores and 


level needs be made at the present 
time (if it is understood that the pur- 
pose of drive theory is to investigate 
the effects of drive once in operation 
rather than the development of a 
comprehensive theory of anxiety as a 


personality phenomenon). That ts, 
the results of Deese ef a/. (4) seem 
deviate; no other investigation § in- 
volving noxious stimulation 
psychological stress does not assault 
hysterical defenses) has obtained re- 
sults that would be expected if the 
anxiety level of low scoring Ss in- 
creased up to or beyond that of the 
high scoring Ss. If such stimulation 
has any differential effect at all, it 
appears to be in the direction of in- 
creasing the anxiety of the anxious 


(since 
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group proportionately more than the 
nonanxious. Examining the Eriksen 
results and accepting them as reli- 
able, there seems to be no firm basis 
for suggesting that drive theory 
would have predicted more stimulus 
generalization for the high Pt group 
than the high J/y. Such a claim rests 
on the assumption that all nonanx- 
ious Ss would be low //y and all anx- 
ious high Pt. The magnitude of the 
reported correlation coefficients, par- 
ticularly between the MAS and the 
hvsteria scale does not make this 
assumption seem reasonable. 
Even if high J7Zy Ss do become dis- 
turbed under nonescapable stress, a 
sufficient number of Ss could remain 
in the nonanxious group who were 
“genuinely” nonanxious, or whose de- 
fenses remained intact, to have a non- 
anxious group exhibit less stimulus 
generalization than the anxious. 
More relevant than such armchair 
argument, however, are Rosenbaum's 
(28) results. Using an experimental 
arrangement very similar to’ Erik- 
Rosenbaum found, it will be 
recalled, more stimulus generaliza- 
tion for anxious than nonanxious, 
and even more important, that the 
difference between Was sig- 
nificant only under the conditions ot 
strong shock. 


too 


sens, 


groups 


\LAS AND CLINICAL MEASURES 
OF ANXIETY 


As was indicated earlier, the mean- 
ing of the term ‘‘anxiety”’ 
the studies attempting to determine 
the relationship between drive and 
performance has been only in terms 
MAS scores. While such pure 
operationism is methodologically 
sound, the generality of these results 
would be considerably expanded were 
a relationship established between 
the MAS and more common clinical 
definitions of anxiety. Most valuable 


as used in 


of 
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would seem to be a comparison ot 


scale scores with observers’ ratings 


of overt behavior since other diag- 


nostic tests of anxicty are themselves 
purported to be indicators of such 
behavior. Fortunately, several stud- 
ies relating MAS scores and obser- 
vational data have been carried out. 
In the tirst of these investigations, 
reported by Gleser and Ulett (9) of 
Washington University, a psychia- 
trist rated 151 normal individuals 
and 40) psvehiatric patients with 
overt anxiety as a prominent symp- 
tom after an hour interview with each 
subject. Ratings were made on an 8- 
point scale of anxiety-proneness, de- 
fined as the tendency for overt anx- 
symptoms to appear ina stressful 
For the total the 
correlation between these ratings 
and MAS scores was .61. Other simi- 
lar studies by the Washington group 
(45, 46) with more restricted samples 
indicated 


1et\ 


situation group 


lower coefticients. In a 
study of 110 male students, involving 
the judgments of two psychiatrists, 
the correlated .28 and .29 
with MAS scores for the two raters, 
while the interjudge reliabilitv. was 
28 (46). 


nificant. 


ratings 


\ll correlations were si¢- 
Lastly the Washington 
group reported a coefficient of .40 be- 
tween the ratings of a single psvchia- 
trist and anxiety scores for 141 nor- 
mal Ss (45). 

Operating in a student-counseling- 
center Hovt 
(13) asked experienced counselors to 
rate their own chents (.V =289) into 
high, medium, 
or low manifest anxiety. Comparing 
the mean MAS scores for each of the 
resulting anxiety groups, an 
tremely significant chi square 
found, while the contingency coefti- 
cient, used as an estimate of the r to be 
expected if the variable had been 
continuous, was .47. Using a still dif- 


setting, and Magoon 


one ot three groups: 


CX\- 
Was 


MANIFEST 
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ferent criterion of clinical anxiety 
Kendall (16) had pairs of nurses rate 
TB patients on their ward on a 7- 
point rating scale for each of nine 
aspects ot manitest anniety. Selecting 
from the 93 patients so rated the up- 
per and lower 27°% in terms of MAS 
scores, Kendall compared the differ- 
ence in mean over-all anxiety ratings 
for the two groups and found it to be 
statistically insignificant; taking only 
the upper and lower 13°, on the 
MAS, a very significant ¢ between 
mean ratings was obtained. 

Finally, a study by Buss, Wiener, 
Durkee, and Baer (2) represents one 
of the few investigations utilizing 
hospitalized = psvchiatric — patients. 
Keach of their 64 patients was inter- 
viewed and then rated by four psv- 
chologists on nine aspects of directly 
observed and reported anxiety. Cor- 
relations between judges’ pooled rat- 
ings and MAS scores ranged between 
.16 to .68 for these various aspects; 
the correlation with an over-all rating 
of anxiety was .60. 

The variation in the training of the 
raters, opportunity for observation, 
rating scales, and populations from 
which the subjects were drawn makes 
it difficult to formulate any statement 
about the “walidity”’ of the MAS. To 
the extent that all of these observa- 
tional criteria are themselves cor- 
related and are agreed to be clinically 
acceptable indices of manifest anx- 
ietv, there does seem to be some rela- 
tionship between MAS and observed 
behavior. These results suggest, 
then, that the experimental results 
obtained with the anxiety scale might 
also hold tor groups selected accord- 
it g to clinical criteria. Such studies 
as have been reported about the per- 
formance of clinically selected anx- 
ious groups on comparable tasks tend 
to confirm this suggestion (1, 20, 30, 
47). 
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In addition to the experimental 
studies of the performance of anxious 
and nonanxious groups already dis- 
cussed, a number of other investiga- 
tions have reported differences in the 
behavior of anxious and nonanxious 
Ss, ranging from indications of num- 
ber of food aversions (31) to per- 
formance in problem-solving tasks 
(21,49). The exclusion of these many 
experiments from consideration here, 
due to the limited purpose of this 
paper—that of assessing the evidence 
directly relevant to drive theory— 
points up what has not always been 
fully appreciated about this theory. 
It is an extremely restricted one, re- 
ferring only to the effects of drive 
level (rather than all characteristics 
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of anxious and nonanxious individ- 
uals) in relatively simple learning 
situations. The major prediction of 
the theory, that there is an interac- 
tion between anxiety level and task 
complexity, seems to be fairly well 
substantiated by experimental evi- 
dence, although more exact deduc- 
tions have either not been tested as 
yet or have not fared as well. 
Whether the theory can be success- 
fully applied to more complex situa- 
tions than those for which it origi- 
nally seemed appropriate, as some 
have attempted to do, or whether 
additional variables can be added to 
it and thus broaden its usefulness re- 
mains for future research to deter- 
mine. 
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BEHAVIORAL EFFECTS OF IONIZING RADIATIONS 
ERNEST FURCHTGOTT! 
University of Tennessee 


Psychology has not been by- 
passed in the current general interest 
in ionizing radiations. Since World 
War Il a number of laboratories 
maintained by the U.S. government 
have conducted research in this area. 
In addition, several research projects 
have been sponsored by government 
agencies in non-Federal institutions. 
On March 31, 1955, there were in 
progress no than seven such 
separate projects having no security 
classification (50). All of this activity 
would seem to warrant a brief review 
of the problem. 


less 


The Stimuli 


The biological effects of high en- 
ergy radiations are ascribable pri- 


marily to changes brought about in 
cells by tonization, detined as the 
removal ot from atoms. 
Different types of radiations produce 
biological effects differing primarily 
quantitatively, rather than qualita- 
tively. Two general classes of radia- 
tions may be distinguished. 

1. Material radiations consist of 
streams of particles which transfer 
their kinetic the targets 
which they strike. The particles dif- 
fering in mass andor electrical 
charge are neutrons, alpha particles, 
electrons (beta particles), deuterons, or 
protons. These radiations have been 
utilized only very rarely in behavioral 
studies. 

2. Electromagnetic radiations con- 
sist of oscillating electric and mag- 


electrons 


energy to 


' I wish to express my gratitude to Dr. S. R. 
Tipton of the U.T. Department of Zoology 
for critically reading portions of the manu- 
script. 


netic fields. They do display also 
corpuscular (photon) properties. Psy- 
chologists are familiar with “light” 
rays which lie in the frequency range 
of 10" cycles per second (wave- 
length range 9X10-°—410-° cm.). 
Radiations above 10'* cycles per sec- 
ond are capable of ejecting inner 
electrons from atoms. Radiations in 
the 10'*— 10°" evcles per second range 
(10-*—10-' em.) are called X rays; 
those between 10'°—10" cycles per 
second (10-*—10-" em.) gamma rays 
(the latter are usually produced by 
oscillating currents within the atomic 
nuclei themselves). Gamma _ rays 
often accompany the disintegration 
of radioactive substances. 

The relative biological effectiveness 
of various radiations is a function not 
only of the total number of 
formed, but also of the spatial dis- 
tribution of the ions in the tissues. 
The terms /imear ion density or linear 
energy transfer are used to express 
the relative density of ionization 
per unit length of tissue. Beta and 
gamma rays produce 6.3-11 ions 
per micron of tissue, 1,000 kv. X rays 
approximately 15, 200 kv. X rays 80 
and lower voltage XN rays a_ still 
higher number. lonization following 
neutron radiations produces up to 
9,000 ions per micron of tissue (26, 
p. 118). Biological effectiveness of 
radiation increases, decreases, or is 
independent of linear energy transfer. 
Thus some activities are affected 
more by alpha particles than by 
gamma rays, while in other functions 
the reverse may be the case. In mam- 
mals we usually find that effective- 
ness increases with ion density. 


rons 
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Measurement of Radiations 

Ideally we would like to measure 
the actual amount of ionization in 
tissues, but this is not possible. We 
must be satisfied with specifving the 
physical characteristics of the source 
and the target. T[onization of air is 
actually approximated for ionization 
in tissues. In the case of X and 
gamma rays the unit roentgen, r., is 
defined as that quantity or dose of 
XN or gamma radiation which pro- 
duces in 0.001293 ¢g. of air one elec- 
trostatic unit of ions (37, p. 90). 

In the case of material radiations 
a different unit is used, the roentgen 
equivalent physical, which is 
“that quantity of ionizing radiation 
which will produce 1.610" ion 
pairs per gram of tissue’’ (37, p. 436). 
Occasionally, the roentgen equivalent 
man, rem, unit is used which is that 
“quantity of radiation which when 
absorbed by man produces an effect 
equivalent to that produced by ab- 
sorption of one roentgen of X or 
gamma radiation” (37, p. 436). 

A few values will be cited to make 
the roentgen unit more meaningful. 
The human daily whole-body 
exposure has been set between 0.05 
—0.25 r. per day (37, p. 436; 64, p. 
89). The threshold for the mitotic 
effect in the grasshopper is 8.0 r. 
(64, p. 89). The 30-day 50 per cent 
lethal dose after 100-250 kv. X-ray 
whole-body exposure is about 315 r. 
for the dog, around 500 r. for man 


(55, p. 930). 


rep, 


sate 


GENERAL PRINCIPLES 
OF RADIOBIOLOGY 


It was pointed out previously that 
radiation-induced effects result pri- 


marily from ionization producing 
physicochemical changes in the liv- 
ing cells. Two general theories con- 
cerning the mode of action of radia- 
tion have been put forward. Accord- 
ing to the farget theory certain mole- 
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cules of the cell are especially radio- 
sensitive and it is the change in these 
specific parts which accounts for the 
observed radiation effects. Opposing 
theorists suggest that radiation af- 
fects the cell as a whole by releasing 
certain chemical agents which inter- 
fere with the normal cell metabolism. 
Actually there is evidence support- 
ing both viewpoints. For purposes 
of this review it is not necessary to 
examine this problem any further. 

The following variables are im- 
portant in the study of radiation ef- 
fects: (55) 

1. Quantity. In most 
effects are directly related to the dose. 

2. Rate of delivery or dosage (sum 
of doses accumulated over a period 
of time). In most cases effectiveness 
of a given dose decreases with a de- 
crease in rate of exposure. Recovery 
may account for this. For example, 
in the monkey a single dose of 7,500 
r. applied to the spinal cord produces 
paraplegia, but two daily 5,000 r. 
doses or five daily 3,000 r. doses are 
required (48). 

3. Type of radiation. Usually in 
mammals directls 
related to the specific ion density of 
the radiation. 

4. Manner of exposure. 
to total-body irradiation are differ- 
ent from those in which only 
lected part of the organism is ex- 
Shielding of certain parts of 
the body (spleen, extremities, etc.) 
can the effectiveness of 
total-body exposure. This is espe- 
cially important in the study of the 
effects on the c.n.s. since doses larger 
than the median lethal total-body 
dose are necessary for changes to be 
apparent. 

5. Time after exposure that observa- 
tions are made. Many of the radiobio- 
logical effects exhibit latencies. These 
may be of varying order of magni- 
tudes ranging from seconds to years. 
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effectiveness is 
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6. Species differences. The 30-day 
LD,,? for total-body X irradiation for 
the rabbit is approximately 800 r., 
for the guinea pig 200-400 r., rat 
600-700 r., monkey 500 r. (55, p. 
930). 

7. Sex differences and individual 
differences within the same species. For 
example, the same dose of X rays 
kills more male than female mice, 
but affects the weight of females to a 
greater extent (7). 

8. Conditions of the organism. Con- 
ditions which may be called “stress,” 
from normal resting 
state, usually enhance effectiveness 
of radiations. Vitamin deficiencies, 
infections, low temperatures in un- 
acclimated animals, exhaustive ex- 
ercises, adrenalectomies all seem to 
increase radiation effects. 

9. Drugs and Certain 
drugs like cvsteine, glutathione, alco- 
hol, and anoxia actually 
radiation effects. 

10. Reproductive activity of the tis- 
sues. As early as 1906 Bergonié and 
Tribondeau hypothesized that pro- 
liferating usually most 
radiosensitive. We tind, for example, 
that while the nervous svstem of 
adult organisms is relatively radio- 
resistant the embryonic neurons are 
extremely radiosensitive. 

Radiation sensitivity varies con- 
siderably from tissue to tissue. For 


i.e., deviation 


anoxia. 


depress 


tissues are 


a detailed discussion the reader may 


consult the radiation literature. We 
shall mention here only a few of the 
effects, of interest to the psycholo- 
gist. 

The hematopoietic system is eX- 
tremely A decrease 
in the number of circulating lympho- 
cytes is one of the most sensitive in- 
dicators of radiation overexposure. 
Other blood components also show 


radiosensitive. 


? Dose required to kill 50 per cent of the 
animals during the first 30-dav postirradiation 
period, 
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pathological changes. Hemorrhagic 
manifestations are also quite com- 
mon after acute irradiation. Vascu- 
lar changes are major contributors 
to the brain pathologies observed 
after large doses of irradiation (9). 
Generalized circulatory changes are 
only minor after median lethal doses, 
but with larger doses the effects are 
more pronounced (56). 

There is some disturbance in water 
metabolism. Several studies have 
reported changes in water intake 
after X irradiation (15, 53, 54). 

The gastrointestinal tract is ex- 
tremely radiosensitive. Anorexia, 
nausea, and vomiting are among the 
clinical symptoms of radiation sick- 
(overexposure to radiation). 
Depression of food intake and a loss 
of body weight can be observed in ir- 
radiated animals. The magnitude 
and duration of the depression are a 
function of the dosage (63). 
body weight can be thus used as an 
indication of radiation sickness. 

The endocrine glands, except for 
the gonads, are relatively resistant 
to radiation damage. However, radi- 
ations act as and they 
give rise to the well-known pituitary- 
adrenocortical response (56). 

The cornea, conjunctiva, and the 
lens of the eve are also quite radio- 
sensitive but the latency of human 
radiation cataracts may be measured 
in terms of vears (56). 

Muscle is very resistant to radia- 
tion. The nervous system is also rela- 
tively radioresistant. Both will be 
considered in greater detail further 
on. 


ness 


Loss of 


“Stressors” 


stress 


PRE- AND NEONATAL RADIATION 

An excellent review of the effects 
of prenatal irradiation has been writ- 
ten by L. B. Russell (61). 

One of the crucial variables in pre- 
natal irradiation is the stage at which 
exposure occurs. Russell (61) divides 
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the mammalian gestation period into 
three stages; preimplantation, major 
organogenesis, period of the fetus. 
In the rat these periods correspond 
to the following postconception days: 
0-7, 8-15, 16-term. During the pre- 
implantation period radiation pro- 
duces a high percentage of prenatal 
deaths, but the survivors are usually 
normal. The radiation during the 
period of major organogenesis results 
in lower prenatal mortality, but it is 
the most sensitive period for the pro- 
duction of morphological abnormali- 
ties. Radiation during the period of 
the fetus produces lesser changes. 
Among the most sensitive systems 
during the prenatal period is the cen- 
tral nervous system. Russell (61) 
quotes studies dating back to 1907 
which show marked morphological 
changes following X irradiation. In a 
series of studies on rats and mice Hicks 
(29, 30) showed that X irradiation 


during different stages of the gestation 
period affects different parts of the 


There seem to be 
critical periods for abnormalities of 
various tvpes. Irradiation during the 
first eight days of embryonic life 
produced no effects on the n.s. of 
surviving animals. Irradiation on 
the ninth day resulted primarily in 
anencephaly ; on the tenth day in en- 
cephalocele and cerebral deforma- 
tion; on the eleventh day it narrowed 
the aqueduct, produced hydrocephaly 
or encephalocele; on the thirteenth 
to the sixteenth dav the basal gan- 
glia, cortex, hippocampus, and cor- 
pus callosum were damaged. From 
the sixteenth day of gestation through 
the neonatal period the cerebellum is 
especially radiosensitive. Hicks em- 
phasizes, however, that the above 
periods are only indices of the most 
frequently occurring pathologies and 
that there is no exact relationship 
between age of irradiation and spe- 
cific malformations. Also it should 


nervous system. 
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be pointed out that it is rather diffi- 
cult to determine the precise age of 
embrvos. Wilson and co-workers 
(68, 69) have shown that neural dam- 
age is directly related to dosage. 
They irradiated rats on the ninth and 
tenth day of gestation with doses 
ranging from 25 to 400 r. On the 
ninth day 25 r. produced ocular mal- 
formation in only a small percentage 
of animals; 50 r. affected 72 per cent 
of the animals, 100 r. produced 
anophthalmia, microphthalmia, or 
other ocular malformations in 90 per 
cent of the animals; 200 r. proved 
fatal to all embryos. Brain damages 
showed similar trends. The data for 
the animals irradiated on the tenth 
day of gestation were similar, except 
that the doses required were higher. 
Fifty r. had little effect, but 100 r. 
resulted in anomalous eve develop- 
ment in 75 per cent of the cases. 

In this connection a study by Rugh 
et al. (59) is of some interest. Rat 
fetuses 13.5 days old were exposed to 
300 r. of X irradiation. In animals 
examined four hours after exposure 
the retinae revealed massive d image 
On the other hand, animals examined 
six to seven davs after birth had few 
injury. Apparently a re- 
covery process took place not by re- 
pair of the damaged cells, but by 
proliferation of the more radiore- 
sistant precursor neuroectoderm cells. 

There are a number of clinical re- 
ports of various abnormalities such 
as microcephaly, hydrocephaly, men- 
tal deficiency, ocular malformations, 
blindness and other types of neural 
malformations which are ascribed to 
fetal XN irradiation (25, 49). Micro- 
cephaly is the most frequently re- 
ported abnormality—17 out of 25 
abnormal cases in one study (49). In 
clinical studies, however, no 
damage is reported following pelvic 
irradiation during pregnancy (61, p. 
909). It is possible that the exposure 


signs of 


some 
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in the latter cases occurred after the 
critical period. In the study of 30 
pregnant women who showed one or 
more major signs of radiation follow- 
ing the Nagasaki atomic bomb blast, 
four out of sixteen surviving children 
showed signs of mental retardation 
(70). The report does not specify the 
nature or extent of the deficit. 

So tar only two studies have been 
reported which measured specific be- 
havioral consequences of prenatal 
irradiation. Levinson (42) fetally ir- 
radiated rats with 300 to 600 r. X 
ravs on the 11th, 13th, 15th, 17th, 
and 19th postconception days. When 
the animals were 50 days old they 
were tested on a Lashley Type Il 
maze. Learning measured in terms 
of number of trials necessary to reach 
a criterion, number of errors, and 
time spent in the maze was impaired 
with the deficits directly related to 
Radiation on the 

produced the greatest 
This agrees roughly with 
Hicks’ timetable for cortical damage 
(29, 30). Variability was larger in the 
experimental groups than in con- 
trols. Tait et al. (65) N-irradiated 
rats during the tinal week of preg- 
naney using 30, 90, 180, and 360 r. 


the radiation dose. 
13th day 
changes. 


The offspring of the animals receiv- 


ing 90 or more r. were significantly, 


poorer maze learners than control 
animals. 

Summary. While there is a great 
deal of evidence for the relative radi- 
osensitivity of the fetal nervous sys- 
tem, our behavioral data are rather 
scant. We do not know what kinds of 
activities aside from maze learning 
are affected nor the lower thresholds 
of radiation-induced changes. The 
latter may be of practical signiti- 


cance. 
THe ApuLT NERVOUS SYSTEM 


It has been known for a long time 
that the adult nervous svstem is rela- 
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tively radioresistant. Doses in the 
median total-body lethal range pro- 
duce no observable neural changes. 
However, with larger doses, in the 
case of mammals generally over 1,000 
r., anumber of investigators have ob- 
tained definite signs of neural degen- 
eration in a variety of organisms 
man, monkey, dog, rat, rabbit, tish, 
(1, 2, 6, 9, 10, 12, 27, 31, 45, 46, 48, 
59, 61). In general the amount of de- 
is directly re- 
lated to the dose and conversely an 
indirect relationship holds for the 
latency (2, 9, 10, 12, 31, 59). With 
relatively low doses, a few thousand 
r., the latency may be a matter of 
months, a vear, or longer (2, 10, 27, 
45). Many investigators assume that 
the initially observed neuronal dam- 
effect. resulting 
from damage to the vascular system 
in the brain (6, 9, 27, 58, 60). Some 
recent studies, however, deny the 
necessity of this assumption (2). It 
might be mentioned here also that 
certain methodological 
advantages the use of radioactive 
cobalt has been proposed tor the pro- 
duction of circumscribed brain le- 
sions (62). 

Aside from histological studies, we 
have information on functional 
changes. Reflex excitability decreases 
as a function of dose, with high doses 
abolishing the retlex completely (19, 
20). krequently enhancement pre- 
cedes the depression (2, 23, 39). But 
again it should be emphasized that 
median total-body lethal doses pro- 
duce no measurable changes 
(13). 

In a study in which the heads of 
rabbits were irradiated using 12,500 
r. (23) after a latent period of 30 
minutes a convulsive phase with 
grand mal seizures appeared. This 
was followed by a somnolent phase 
of two hours’ duration in which the 
animals were quite inactive. Finally, 


generation observed 


age is a secondary 


because ot 


easily 





326 


in the last stages before death, ataxia 
was the most pronounced symptom. 
Changes in equilibrium and disorien- 
tation in space have been reported by 
a large number of investigators (1, 
10, 46, 52, 58, 60). This is in accord 
with several histological reports that 
the brain stem and cerebellum are 
the most frequent sites of radiation 
necrosis. Hemi- or quadriplegia is a 
common symptom after large doses 
(2, 10, 46, 48, 58). 

EEG changes have been recorded 
by several investigators (2, 9, 13, 39, 
58), but again the lower threshold is 
above the median total-body lethal 
dose. The typical pattern is similar 
to that seen in seizures, 1.e., periodic 
spikings, high amplitude slow waves. 

The most sensitive parts of the 
brain are the hypothalamus, glial 
cells, brain stem including the me- 
dulla and the cerebellum (2, 6, 9, 10, 
12, 31). The cortex is more radio- 
resistant than these structures, and 
this is of course significant in 
havioral work. 

The peripheral nervous system is 
even less sensitive than the c.n.s. to 
radiations (32). Doses below 10,000 
r. are ineffective. It takes 45,000- 
75,000 r. to abolish nerve conduction 
in peripheral fibers (22). The auto- 
nomic n.s. responds with a vagotonia 
after an initial short duration sym- 
pathicotonia (66). A slight decrease 
in pulse amplitude has been reported 
already after 750 r. Also certain para- 
sympathomimetic effects may be ob- 
served during radiation sickness (56, 
p. 996). 

Skeletal muscles are also relatively 
radioresistant. With below 
6,000 r. no abnormality may be ob- 
served (43). Gerstner, et al. (21) ap- 
plied 50,000 r. to the rabbit gastroc- 
nemius and they noted that fatigue 
effects could be observed only when 
high performance was demanded by 
using a heavy load or a high fre- 
quency of stimulation. 


be- 


dc Ses 
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BEHAVIORAL CHANGES 


Almost since the discovery of X 
rays investigators have reported vari- 
ous changes in organisms following 
radiation. Lyman, et a/. (46) in their 
exhaustive 1933 review of neural 
changes refer to a study by Tark- 
hanov who in 1896 observed quieter 
behavior in flies following X irradia- 
tion. There is also an abundance of 
individual clinical studies in 
which radiation was applied for ther- 
apeutic ‘purposes. This review will 
emphasize primarily those studies, 
however, which were designed spe- 
cifically to investigate behavioral ef- 
fects. The latter includes those 


case 


phenomena customarily included in 
the field of psychology. 


Learning and Performance 
g 


The first attempts to assess the 
effects of radiations on learning were 
performed in Pavlov’s laboratory. 
Nemenow (51, 52) irradiated the 
head of one dog with a dose of 1,500 r. 
There was only a slight drop in 
his salivary CR’'s. After an addi- 
tional 2,200 r., however, the CR’s 
practically disappeared for a period 
of tive weeks. A second dog received 
3,500 r. then again 2,800 r. and the 
results were essentially similar to 
those seen in the first animal. Ly- 
man, et al. (46) X-irradiated the oc- 
cipital part of the head of four dogs 
with massive doses of 17,000-18,000 
r. after their CR’s had been stabil- 
ized. All of the animals showed a 
temporary decrease in their salivary 
CR’s, but the onset and duration of 
this decrement varied. Two of the 
animals (“excited types’’) actually 
showed a rise in CR’s preceding the 
drop. The strength of the responses 
also varied as a function of the tvpe 
of CS. One of the animals kept alive 
for six months after the treatment 
exhibited a second lowering of CR's 
following the recovery from the first 
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decrease. This latency in the gross 
pathological manifestations is con- 
sistent with the other investigations 
discussed in the previous section. 
The change in the CR’s occurred dur- 
ing a period when the S exhibited 
ataxia, impaired vision, circus move- 
ments, and general deterioration of 
behavior. It was difficult to test the 
Interpretation of the data from 
the whole study is obscured by the 
observation that in three Ss not only 
the CR’s but the UR’s also showed a 
drop. 

In a study for which an abstract 
only is available, Harlow (28) re- 
ports that radon tubes inserted into 
the cortex of ten rhesus monkeys 
pre duc ed progressive loss on delaved 


dog. 


reaction, patterned string tests, and 
simple position habits. No data are 
given for the dosage used. It was 
probably quite large in view of other 
negative findings reviewed below. 
No further work was done in this 
field until after World War II. 
Furchtgott (16) tested rats exposed 
to 200-500 r. of total X radiation in a 
four-unit water maze. Neither acqui- 
sition nor retention using several cri- 
terion measures was affected by the 
treatment. Arnold (3) exposed the 
heads only of rats to 300-800 r. and 
tested them for retention of a 14- 
unit T-maze habit and other irradi- 
ated Ss were tested for the learning 
of the habit. No statistically signif- 
icant were found.- Fields 
14) studied performance on elevated 
32- and 40-choice-point 
elevated T-mazes, and a 10-choice-5- 
vertical maze of some 500 male 
rats which had received doses rang- 
ing from 100-1,000 r. On the whole 
radiation had little effect on the per- 
formance of the animals except for a 
decrease in the speed and amount of 
activity immediately following irra- 
diation which was probably due to 
the general radiation malaise. Davis 
(11) tested rhesus monkeys in the 


changes 
runways, 


stace 
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Wisconsin General Test apparatus 
following X irradiation but was unable 
to find any impairment ot perform- 
ance on disecrimination-type tasks. 
In a series of studies sponsored by 
the U.S. Air Force School of Aviation 
Medicine (34, 57), monkeys were 
tested on acquisition, retention, and 
transter of multiple discrimination 
problems immediately and 150 days 
after exposure to sublethal and lethal 
doses of X rays. Again the reported 
results failed to demonstrate any de- 
leterious effects. The only deficit 
that was noted was an increase in re- 
action time. 

Garcia et al. (18) established a con- 
ditioned aversion to a saccharine 
solution which was associated with 
exposure to gamma irradiation. Ex- 
perimental animals had saccharine 
solutions in their cages while being 
exposed for six hours in the gamma 
tield, while control Ss had tap water. 
Preference was then tested for 63 
postirradiation days. The control 
group showed no loss of their natural 
preference for saccharine, while ex- 
perimental Ss exposed to only 30 r. 
showed a significant drop in. their 
saccharine intake. The authors hy- 
pothesize a general behavior dis- 
turbance during radiation which be- 
came associated with the taste stim- 
muli. It should be pointed out that 


the animals were being exposed at a 
very slow rate and some of the general 
radiation malaise might have been 


effective for a sufficient length of 
time for the conditioning. The effec- 
tiveness of the low dose used is sur- 
prising, however. 

Jones et al. (33) measured the ef- 
fects of 200-1,000 r. of whole-body 
X irradiation on activity-wheel per- 
formance, using 194 rats. Data were 
analyzed separately for animals who 
survived the eight-week experimental 
period and those that succumbed to 
radiation injury. Rats which died 
during the first nine postirradiation 
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days showed a gradual decrease in 
activity until death. Those that sur- 
vived nine days, but died subse- 
quently showed a decrease immedi- 
ately after irradiation followed by a 
recovery and a second depression of 
activity prior to death. All of the 
surviving animals 200-680 
r.—all animals with higher 
died) showed a decrease in activity 
postirradiation. The 200-300) r. 
groups recovered completely by the 
fifth day. The higher-dose groups 
also showed a partial recovery dur- 
ing the first postirradiation week, but 
they exhibited a second depression 
during the third week. The 400-450 
r. groups attained normal levels of 
activity four weeks after irradiation, 
the 681 r. groups after eight weeks. 
In general there was a direct rela- 
tionship between degree and dura- 
tion of activity depression. 

In another study the same group 
of investigators (36) tested the ef- 
fects of 300—-1,000 r. X irradiation on 
exhaustive swimming exercise. The 
rats were placed into a 24-gallon 
tank where they were forced to swim 
until they were exhausted and sank, 
remaining below the surface of the 
water for longer than 30 seconds at 
which time they were retrieved. 
Length of swimming time before 
sinking was measured. Following 
radiation, performance gradually de- 
minimum level 
during the third to fourth postirradia- 
tion week. From then on there was 
a gradual return to the normal level 
which attained by the ninth 
week. While the depression was di- 
rectly related to the 300 r. 
group barely differed from control 
animals. The 500 r. group, however, 
showed a significant drop and the 
higher r. animals in turn differed sig- 
nificantly from the 500 r. group. 

Furchtgott (15) subjected adoles- 
cent rats to 300 and 500 r. of X ravs 
and tested their swimming speed in 


(doses 


ck ses 


creased, reaching a 
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dose, 
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a 12 it. straight-away tank for 13 
days. The 300 r. group did not differ 
from the controls, but the survivors 
in the 500 r. group were significantly 
slower. 

Vogel (67) daily irradiated with 50 
r. X ravs six aggressive mice each ot 
whom, prior to the treatment, always 
defeated submissive animals. Even 
after irradiation the aggressive ani- 
mals continued to be dominant until 
shortly before death. 

McDowell (47) observed a reduc- 
tion in “other-animal involved” be- 
havior and visual attention to the 
activity of other animals following 
400 r. of X irradiation in 10 rhesus 
monkevs. The animals also showed 
fewer instances of aggression and a 
vreater incidence of lethargy. All of 
these symptoms easily under- 
standable the general 
malaise which is associated with ra- 
diation. 

Leary and Ruch (38) exposed 18 
rhesus monkeys to 200-400 r. of total- 
body X 
were not affected. 
irradiation day 
animals (the 
served scratching, grooming, and oth- 


are 


considering 


“age-crossings 
On the first post- 
the 400 r. 
were not 


irradiation. 


only for 


others ob- 


er signs of activity were depressed—a 
Mechanical 
puzzle manipulation did not produce 
statistically significant differences be- 
tween pre- and postirradiation peri- 
Pedometer manipulation was 
impaired in the 400 r. animals (others 


sign of general malaise 


ods. 


were not tested) and = surprisingly 
weight-pulling, supposedly a measure 
of general strength, did not decrease 
in all animals. 

In general it may be said that ra 
diation produces a certain amount of 
depression in activity which should 
be most apparent when 
is low or when the task requires a 
great deal of effort as in the exhaus- 
tive swimming experiment (36). The 
latter effect would tend to parallel 
Gerstner’s, et al. (22), findings on the 


motivation 
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effect of NX 
contraction. 


irradiation on muscular 

There is a puzzling report of a 
clinical study of 120 patients who 
had received several single doses of 
30-50 r. during a 7-10 day period 
totaling 150-250 r. of diencephalic X 
irradiation (4). Immediately follow- 
ing the treatments the typical svmp- 
toms were numbness, apathy, and 
tingling the head re- 
The dav after the irradiation, 
however, most patients reported spon- 
taneously that they felt euphoric, ac- 
tive, and generally This 
state lasted from a week to several 
months. Most of the treated 
were neuropsychiatric 


sensations in 


gion. 


tranquil. 


Pa- 
tients cases 
, Migraine, 
However, in addi- 
tion, two medical collaborators sub- 


jected themselves to 100 r. adminis- 


with diagnoses of urticaria 


depress mi. C&C. 


tered to the diencephalon and they 
also experienced the same changes as 
the patients. Sixty-one of the pa- 
tients reported changes in their sleep 
patterns. The sleep on the night fol- 
the treatment 


characterized as 


lowing Was usually 


“extremely deep,” 
“leaden.” In addition 
37.5 per cent of the Ss reported sexual 
notably 


“heavy,” or 
changes, improvement in 
libido, potency, and the menses. 

The authors ascribe these changes 
to hypothalamic stimulation 
marily of the anterior, 


pri- 
parasvmpa- 
thetic nuclei, a finding in accord with 
the frequently 
induced 


reported = radiation- 
(66 . These re- 


sults, if confirmed by other investiga- 


vagotonia 


tors, should have therapeutic impli- 
cations. They also raise many ques- 
tious of interest to the experimental- 
ist working animals 
have practically no 


since we 
emo- 
tional behavior following radiation. 

Summary. The lack of any dra 
changes in learning funetions 
following sublethal or just lethal 
total-body N= irradiations reported 
by several experimenters agrees vers 


with 


data on 


Mate 
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well with similar neurophysiological 
observations on the resistance of the 
nervous system in that dose range. It 
takes doses which are well above the 
median total-body lethal range to 
produce any neural changes and then 
there is usually a considerable la- 
tency. In the one study in which 
there was an 18-month time lapse 
between the treatment and testing, 
no drastic decrements took place (14). 
Whether a longer period would have 
any effects is an open question. While 
acquisition, retention, or transfer are 
not affected, performance indices 
which utilize gross muscular activity 
are impaired to some extent and this 
impairment persists for a number of 
months (in the study of swimming 
endurance [36] up to nine months for 
the most heavily irradiated group). 
Another factor to be considered is 
what might be called, for the lack of a 
better name, general malaise, which 
includes a lack of motivation to re- 
spond to stimuli or initiate activity 
which is present immediately follow- 
ing radiation and appears again dur- 
ing the second week in more heavily 
irradiated animals. This is accompa- 
nied also by a loss in appetite and 
drop in body weight. 


Sensory Functions 


ITearing. In the clinical literature 
there are reports of improved hearing 
following X irradiation. In the early 
thirties Girden (24) working in Cul- 
ler’s laboratory attempted to investi- 
gate this problem using dogs in the 
classical conditioning setup. Standard 
psychophysical procedures were em- 
ploved to obtain absolute intensity 
thresholds. Subsequently the heads 


of twelve animals were irradinted. 
The study was exploratory in nature 
and there was no systematic design 


to test radiation factors. Eight ani- 
mals were irradiated using 80-100 
kv. peak voltage and 5 ma., while 
four animals got roentgen ravs gen- 
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erated at 200 kv. peak and 5 ma. One 
animal received 5 r. every day for 
five months, one 5 r. for four days, 
one anywhere from 100-1,100 r. on 
seven days, spaced one to seven days 
apart, and so forth. The total dosage 
varied from 20-11,100 r. The ani- 
mals which were irradiated with the 
80-100 kv. rays all showed a tran- 
sient gain in acuity which averaged 
5.5 decibels after a latent period of 
seven to eleven days. Dosage was ap- 
parently notinvolved since thechanges 
appeared even after the surprisingl,s 
low value of 20 r.. None of the Ss ir- 
radiated at 200 kv. showed any 1m- 
provement in acuity. 

In a second study Brogden and 
Culler (5) examined more critically 
the effect of dose and also the fre- 
quency variable. Ten animals were 
irradiated at nine different intensities 
ranging from 75 to 675 r. The gain 
in acuity was independent of the 
dose, varving from 3.84 db to 7.87 
db. The duration of the gain was 
also independent of the dose and it 
varied from 8.0 to 10.3 days. The 
latency, however, was inversely re- 
lated to the dose. At 75 r. it was 7.6 
days and at 675 r. it was 2.6 days. 

To explore the mechanism of the 
phenomenon, two were hy- 
pophysectomized and irradiated ; audi- 
tory tests were conducted on a dia- 
betic subject and a normal dog when 
blood-sugar levels were high and low 
(by injection of insulin); and blood- 
sugar levels were measured before 
and after irradiation in one dog. In 
all of these cases hypoglycemia was 
associated with lower auditory thresh- 
olds. The authors hypothesize that 
low sugar levels lessen density and 
viscosity of cochlear fluids, and 
thereby decrease resistance to incom- 
ing vibrations, and perhaps also the 
ionic conditions in the cochlea affect 
the magnitude of the cochlear poten 
tials. 

Vision. 


ck gS 


Fields (14) found no et 
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fects on brightness or acuity discrim 
ination in rats following X irradia- 
tion. Russian workers (35) have re- 
ported that dermal X irradiation in- 
creases the threshold to dark adapta- 
tion and that this effect persists for 
several days. Lenoir (41) tested dark 
adaptation in 11 patients following 
therapeutic irradiation. In all cases 
there was a decrease in dark adapta- 
tion which was independent of the 
dose (2,400—6,240 r.). The changes 
could be detected for 20 to 36 davs. 
The author ascribes this reduction in 
dark vision to a drop in vitamin A 
concentration which follows the X 
irradiation. Furchtgott (17) tested 
brightness discrimination in a Lash- 
ley jumping box under conditions of 
low illumination following 369-469 r. 
of X irradiation. The performance ot 
the irradiated rats was slightly infe- 
rior to that of control animals. It 
should be noted here also that Cibis 
et al. (8) found that rod cells are con- 
siderably more radiosensitive than 
cones, 
1,700-2,000 r. while the 
for cones is 10,000—30,000 r. 
The work on cataract 
has been reviewed adequately 


Destruction of rods required 


threshold 


formation 
(40) 
and it is omitted here since the stud- 
ies involve primarily morphological 
changes. 

Other senses. The work on other 
Lindemann (44) ob- 
served fifteen patients who received 
therapeutic X-ray treatments for 
tumors in the oral cavity. Taste 
sensitivity and in some cases odor 
sensitivity were depressed for several 
months. In an unpublished study 
Furchtgott found some indication of 
lowered thresholds to electric shock 
in rats following sublethal doses of 
whole-body X irradiation. 

Summary. While there is some evi- 
dence for changes in sensory func- 
notably hearing and scotopic 
vision after irradiation, the available 
data are quite limited. Much more 


senses is scant. 


tions 
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research is necessary in the various 
sensory modalities to determine the 
factors, if any, which affect percep- 
tion. 


SUMMARY AND CONCLUSIONS 


The published studies pertaining 
io the behavioral effects of high-en- 
ergy radiations were reviewed. More 
studies have actually been performed 
in this area. The author knows of 
several additional ones, performed 
bv himself and bv others, but the 
negative results have discouraged 
the workers from publishing them. 

Underlving any the 
behavioral effects of radiation 1s the 
relative radioresistance of the adult 
nervous svstem. Total-body 
in the median lethal range do not 
seem to produce any cross neural dys- 
functions. Except for the instances 
in which the body is shielded and the 
radiations are applied to the head 
death will intervene long before 
anv neural changes can be observed. 
Thus we will not find any significant 
behavioral changes in those activities 
which are mediated directly by the 
nervous system. We have reviewed 
several studies of learning by differ- 
ent investigators which seem to bear 
this out. Actually it is possible that 
an investigation will show a decre- 
ment in learning following radiation. 
However this would be primarily a 
in the non- 


discussion of 


doses 


only ’ 


reflection of the change 
associative learning factors, L.e., moti- 
vation and perception of the stimuli. 

We have pointed out that radiation 
produces changes in the blood and 
fluids, tract, 
and some of the endocrine secretions. 
Thus the homeostatic energy-con- 
trolling mechanisms are affected and 


body gastrointestinal 
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we should find, therefore, changes in 
motivation and performance. We 
have indeed seen that some of these 
functions are altered. Food and 
water intake, exhaustive swimming 
exercise, activity wheel and _ pe- 
dometer performance, and social be- 
havior changes have been reported. 
There are still a number of problem 
areas here such as emotionality, 
motivation, other than hunger and 
thirst, which have not been investi- 
gated. Here we should mention again 
that radiation seems to lead to the 
pituitary-adrenocortical stress reac- 
tion and that the hypothalamus and 
the autonomic n.s. are relatively 
more sensitive than the cortex. It 
would seem also that performance 
which requires a large expenditure of 
energy or where extrinsic incentives 
are very small will be affected the 
most by radiations. 

In the sensory field some experi- 
mental work has been reported on 
hearing and vision and we have also 
clinical data on these and other 
modalities. On the whole, however, 
there are large gaps here. 

The great sensitivity of the de- 


veloping nervous system was briefly 
The quantity of behavi- 


discussed. 
oral data not approach our 
knowledge of morphological changes. 
We have only two studies on maze 
learning in rats. It would seem that 
this area should be explored in greater 
detail and functions other than maze 
learning could be explored. 

The genetic aspects of radiation 
were not considered since we have no 
data here on variables which are con- 
ventionally classitied as psychologi- 


does 


eal. 
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COMMENTS ON MEEHL AND ROSEN’S PAPER! 


SAMUEL KARSON anp SAUL B. SELLS 
USAF School of Aviation Medicine, Randolph Field, Texas 


The recent Meehl and 
Rosen (3) rationale for 
evaluating the predictive efficiency 
of psychometric instruments which 
should be of interest and importance 
in clinical and personnel research. 
The purpose of this comment is to 
emphasize the principle of the de- 
pendence of statistical criteria on 
administrative policy in 
the appropriate criterion among the 
cases Which they have efiectively pre- 
sented. 

The statistic for 
evaluation ior predictive etticiency is 
the base rate, according to Meehl and 
Rosen (3, p. 194). Evaluation of any 
predictor requires comparison of re- 
sults based on prediction with the 
base rates prevailing in the situation. 
Thus, if thousand 
available for 


paper by 
presents a 


selecting 


basic reference 


one candidates 
military service 


and the base rate of noneffectiveness 


were 
were 5°,, 950 successful candidates 
might be expected without screening. 
Now, if a screening device operated 
to admit less than 950 successful can- 
didates in the same situation, Meehl 
and Rosen would consider such a test 
less efficient than the base rate. 


Pheir analysis considers three sepa- 
The 


cases of 


rate Cases. 


detecting 


first is efficiency in 
adjustment. 
Here they classify as errors of predic- 


poor 


tion only the false-positives rejected. 
When the false-positiy e rate is higher 
than the base rate of noneffectiveness, 
thev would conclude that use of the 
screening test would be less efficient 
than no screening at all. The second 


The writers wish to express their appre- 
emtion to Dr. Samuel Fulkerson for con- 
tributing to the discussion which culminated 


in the present paper 


case is efficiency in prediction for all 
cases. Here they classify as errors of 
prediction both the false-positives 
rejected and the false-negatives ac- 
cepted. When the number of success- 
ful cases attained through a sample 
of available individuals is lower as a 
result of screening than could be ex- 
pected according to the prevailing 
base rate, they would consider such 
screening ineflicient. The third case 
is called efficiency in detecting cases of 
good adjustment. Here only false- 
negatives regarded as errors. 
Thus to the extent that the propor- 
tion of successfuls in the sample ac- 
cepted is greater than expected ac- 
cording to the base rate, they would 
consider screening to be efficient. 
They point out, however, that such 
efficiency is relative, inasmuch as it 
purchases increased efficiency of per- 
sonnel accepted at the cost of reject- 
ing some potentially successful candi- 


are 


dates in the screening process. 
Although the point is implied by 
Meehl and Rosen, it seems important 
to emphasize as a general principle 
that the choice of the appropriate 
test of efficiency depends on the poli- 
cies in effect and the purposes of 
screening required to fulfil them. 
Widespread misunderstanding of this 
principle could seriously impair the 
status of manv useful screening and 
prediction programs. All too often 
scientists are too preoccupied with 
considerations of validity, while they 
fail to recognize the practical prob- 
lems facing administrators who uti- 
lize psychometric techniques. On the 
other hand, administrators need to 
understand this principle so that they 
may avoid the error of rejecting use- 


eid 
sa 
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ful methods as well as that of accept- 
ing inefficient ones, through faulty 
evaluation. 

With specific reference to indue- 
tion screening of military personnel, 
the manpower administrator is con- 
cerned with supply and demand is- 
sues on one hand, and with the bur- 
den of additional administration and 
loss of productive work due to non- 
effectiveness on the other. In times 
of manpower scarcity, he may be 
pressed to utilize every available 
man. Under such circumstances, he 
would seek to admit the maximum 
number from the available. 
‘bhen the Meehl-Rosen Case 2 would 
be properly applied in evaluating 
prospective screening devices. 

If, however, manpower shortages 
were less pressing, or if the waste 
attributable to noneffectiveness were 
considered great, the adminis- 
trator might be agreeable to the re- 
jection of some potentially successful 
individuals by a screening device 
which could assure a greater propor- 
tion of successful candidates from the 
number admitted than might be ex- 
pected according to the base rate, 
The gross number of successful candi- 
dates for any available sample would 
be less, depending upon the rejection 
rate for the particular screening de- 
vice, but the noneffectiveness rate 
might be reduced. In these circum- 
stances Case 3 would be appropriate 
to evaluate the increase in proportion 
of successful candidates as a result of 
screening and Case 1 could be used to 
evaluate the cost in terms of false- 
positive rate. 

The criterion implied in Case 2 
requires maximization of the number 
of successfuls in relation to the total 


? 


pool available, whereas Case 3. re- 


}« 01 


too 
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quires maximization of the number of 
successfuls in relation to the number 
admitted. Both need to be evaluated 
against the base rate. The former 
criterion may be dictated in circum- 
stances of manpower scarcity, while 
the other would retlect a policy de- 
cision more sensitive to the cost of 
accepting and caring tor nonetfective 
individuals in hospitals, guardhouses, 
and nonproductive jobs. Policy, not 
mathematical reasoning, must dictate 
the appropriate criterion of evalua- 
tion and the proportion of incorrect 
predictions which can be accepted. 

The writers feel that in view of the 
general excellence of Meehl and Ros- 
en's paper, their oversight in connec- 
tion with their discussion of Case 1 
should be mentioned. They demon- 
strate (3, p. 195) that the use of the 
Danielson and Clark (2 
inventory would result in 
in the total percentage of correct 
predictions made (trom 95°) to 
79.7°%) when comparing the test with 
the base rates. They do not, how- 
ever, indicate that the screening in- 
ventory has actually succeeded in 
raising the correctly 
predicted “fails” from 5°% (base rate ) 
to 13°). Later they do recognize this 
kind of gain when they demonstrate 
(3, p. 204) that a certain cutting 
score on the Glueck prediction index 
succeeds in correctly identifving de- 
linquents with an accuracy of 92.6%; 
as compared with an expected 20, 
base rate, even though predictions are 
made for only 2.4°) of the popula- 


screening 


i decrease 


percentage of 


tion. 


21t is of interest to note that the current 
induction screening policy of the armed serv- 
ices emphasizes the second criterion de- 


1,4,5 


scribed 
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There have becn several recent in- 
stances in the psychological literature 
(1, 2, 6, 8, 9) of the use of the statistic 
known as Kendall's tau (T), a non- 
parametric correlation coefficient. It 
is to be hoped that its use reflects a 
growing realization among psycholo- 
gists of the inadequacy of the Pear- 
son product-moment coefficient (r) 
in a number of circumstances. Some 
of these circumstances are: 

1. When the variates to be cor- 
related show sharp departures from 
normality. Although the distribution 
of sample r’s from nonnormal but un- 
correlated populations differs only 
slightly from the normal case (4), it 
may differ considerably when the 
true r is not zero, kurtosis rather than 
skewness being the more important 
factor (3). 

2. When the 
related are unmeasureable according 


variates to be cor- 
to an objective sé ale, as in the case of 
ratings or preferences of judges, or 
when precise Measurement is imprac- 
tical and the raw data must be sets 
of ranks. Under these circumstances, 
the evaluation and interpretation of 
r often requires assumptions which 
it would be imprudent to make 
3. When there is reason to believe 
that the regression of one variate on 
the other is nonlinear, 7 will tend to 
this paper was sup- 
Research Grants M-658C 
and Mii-301 from the National [Institutes of 
Health, U. S. Public Health Service. This 
work was done while both authors were at 


the lowa Child Welfare Research Station 


The preparatic 


ported iu part by 


3.3% 


ca 


underestimate the degree of interde 
pendence. 

The use of a rank-correlation coet 
ficient requires no 
garding the form of the distribution: 
of the variates and is thus admirably 
suited to the resolution of the dith 
culties posed by the first two circum 
stances. .\ rank coefficient also will 
not underestimate a relationship ever 


assumptions re 


when regression is nonlinear so long as 
the regression function is monotoni 
which is usually the case in psycho 
logical research. 

These considerations apply both to 
T and to the better-known rank cor 
relation, Spearman's rho. This paper 
however, will be concerned e¢: 
with the former since it has a number 


ho, and Is Fille ly 


tirely 


of advantages over 1 
discussed in current statistical texts 
The most important of these advat 
tages is that the significance of a sam 
ple tau (+) can be evaluated with cer 
tainty in terms of the normal proba 
bility integral for all but very 
Furthermore, confidence 


small 
values of n. 
limits for T can be determined fron 
sample v's. If the rank-order coett 
cient is regarded as merelv a rou} 
approximation of r, these consider. 
iImMportart 
of an hv- 


tions are not particularly 
When it is used as a test 


pothesis for which it alone is appro 
then these 
the 


priate, as is often the case, 
advantages, especially 
become significant. 
Tau can also be used for the com 
putation of both partial and multiple 
coefficients ) 


lor 


correlation oweve! 
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either of these measures is Very Use- 
tul at present as we shall indicate in 
a subsequent section. 


DEFINITION AND INTERPRETATION 


Pau is detined as 


en ais the number of items ranked 
d S=(P-—@ 
ber of item pairs on the order of which 
both rankings agree, and Q is the 
mber on whi h the, disagree. Tau 
vary from + 1.00 when all possi- 
ble pairings are ranking concordantly, 


_ where P is the num- 


+1 
Will 


1.00 when all pairings are ranked 
cordanths 
Consider the 


following rankings 


tne ‘ambicuitv”’ ol 


ide by 


eight sentences 
with the rank- 
the nat- 


two judyes, 
zs of Judge A arrat 
order 
dye s 


dve | s,s |] j ; 4 


Phe 
1 bv Judge A, has to its rig 


tirst sentence, i.c., the one ranked 
ht in Judge 
ranks ind 4 
1 for each 
1 for each 


The 


reel 


B's rankings 3 | 

le We allot 
ot the larger ranks, and 
ot the 
sentence in Judge V's ranking has to 
its right 


smaller ranks 


smaller ranks. second 
in Judge B's ranking 6 larger 
\nd sO) 
Alloting 
in this fashion, 
+: +6, QO: + iz: 
3 0; +2, —0O: 
the 
l the 
inuses is 11. S, P—Q, is 
thus 6. With 1 =8, we obtain 
ing to |1 7 ot 6 28 aa 

Phe interpre follows 
readily {1} since m(m—1)/2 1s 
the total number of item pairs with 
respect to which the rankings can be 
\ given r 


no smaller ranks 


scntence 4 


P, the sum of 


nd QV, the sum of 


which is 


AC ord- 


tation of 7 


from 


compared value of T asserts 
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that the statement “The order in 
which two items are ranked accord- 
ing to one variate (or judge) will be 
the order in which they are ranked by 
another variate (or judge)"’ will be 
correct (100+-100T)/2 per cent of the 
lime, on the average. 

When there are tied ranks, certain 
adjustments in the computational 
formula for 7 must be made since the 
total number of possible item pairs 
will vary as a function of the ties. If 
there ure ties in only one of the rank- 
ings, we arrange the untied ranking 
in the natural order and proceed to 
compute S as before except that the 
numbers in the second ranking, to the 
right of the items under considera- 
tion, which are the same as the rank 
of this item contribute nothing to the 
value of S. When both rankings con- 
tain ties, We arrange either one in the 
natural order and compute conven- 
tionally except that item pairs which 
are tied in the upper ranking also con- 
tribute nothing to the value of S.* 

The major adjustment for tied 
ranks occurs in the denominator of 
[1] as might be expected. The general 
formula for 7 from tied ranks contain- 
ing the adjusted denominator is: 


? Smith's (12) description of the method for 
calculating t when ties are present is in error 
since it neglects the effects of ties in the upper 
ranking on S. This oversight leads to mark- 
edly unreasonable r's and distorts the sampling 
distribution by producing too many large 
absolute values of r. For instance, in one of 
Smith's examples (12, p. 570), he obtains a 
corrected 7 of +1.00 despite the fact that one 
judge perceived differences between items 
which were rated identically by the other. 
Ihe correct procedure leads toa 7 of .868, 
which expresses the high, though not perfect, 
degree of agreement which ts present. 
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where 


v=32 
The computation of Vo and U will 
be illustrated in the following ex- 
ample. 

In the two sets of ranks below, the 
upper ranking has been arranged in 
the natural order. 

1 2.5 2.5 


? 6 % 6 


The first item, t.e., the one ranked 1 
in the upper ranking, has five larger 
ranks to its right in the lower rank- 
ing, and none smaller. It is not tied 
with any other item in the upper 
ranking, so its contribution to S is 
+5. The second item has 2 smaller 
ranks to its right, and thus contrib- 
utes —2. Although this item is tied 
in the upper ranking, the pairs which 
are tied are not involved in the con- 
tribution. Similarly for the third 
and fourth items. The fifth item has 
2 larger ranks to its right, but one of 
these, the sixth item, is tied with the 
fifth in the upper ranking, and thus 
does not contribute. The net 
tribution of the fifth item is therefore 
only +1 instead of +2. A similar 
procedure for the sixth item leaves it 
with a net contribution of 0. The 
seventh item contributes +1. S, 
the net total, is 7—6=1. 

V and U are obtained in the fol- 
lowing manner: the upper ranking, 
from which V is computed, contains 
two sets of ties, one of extent 2 and 
one of extent 3. For the tirst set, 
y=2, and vo(v—1) =2(2—1) =2. For 

second, v=3, and v(r—1) 
3(3—1) =6. The sum of the expres- 
sions 7(v—1) in the upper ranking is 
(2+6)=8, and V=}(8)=4. The 
lower ranking also contains two sets 


con- 
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of ties, of extents 3 and 5. These will 
enter into the computation of U. For 
these ties, w=3 and 5and u(u—1 6 
and 20. Hence l= }(64+ 20) =13. 

Substituting the computed values 
of S, V and U in [2], 
05.3 

When 7 is computed without ad 
justing the denominator for ties, it 
will alwavs be numerically less than 
when the adjustments « Phe 
use of the uncorrected denominator 
is recommended by Kendall (7 
when agreement with an 
ranking is being determined. 
a case, only the judge's ranking would 
contain ties, and these would 
retically indicate inabilitv to 
criminate the objec tive order, a fail- 
ing for which the judge should prop 
erly be penalized. 
ever, the corrected formula should be 
used rank correlations are 
usually computed when 
rather than accuracy is the issue 

The procedure for adjusting tor 
tied ranks can be generalized to in 
clude cases involving dichotomies. -\ 
dichotomy may be regarded as 2 sets 
of tied ranks of the extents of the 
number in each of the two categorics, 
and the computational 
need not differ from instances 1 
which ties are less extensive. How 
ever, some labor can be avoided by 
the the 


we obtain 7 


re made. 
objec tive 
In such 
theo 
dis 
In general, how 


Si c 


agreement 


prom edure 


use of following formulae 


S 


2-—V |v pg 


NV in(n—1 


when one ot the Variaites is a di hot- 


If the computation of +r when 


it should be pointed 


tres are 
seems tedious, 
out that rho has no advantage in this respect 
The proper computation of rho from. tied 
ranks also involves corrections in both 
numerator and denominator, the latter being 
similar in form and effort to that required for 
r. Unfortunately, most texts fail to mention, 
let alone describe, the corrections for rho, 
thereby creating the belief that 
none are necessary. 


present 


inaccurate 
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omy consisting of p and (n—p)=q 
members in the categories, or 


, [4] 


Vv (pq) (xy) 


when both variates are dichotomized 


S 


into categories consisting of p and q 
y mem- 
In this case, if we arrange the 
frequencies in a 2X2 table as for cor- 
related proportions, S will be found 
to equal the difference between the 
products of the frequen ies in the 


members, and x and (m— x) 


bers. 


diagonal cells. 
TESTS OF SIGNIFICANCE! 


- The distribution of sample 7's for 


uncorrelated variables rapidly ap- 


normalhty ind is satis- 
factorily approximated, when m>10, 
by the distribution with a 


mean of zero and a variance defined 


proaches 
normal 


«td 


ty4+-10 
Qn(n—1 


When ties are present, the formula for 
the becomes comphi- 
cated If the number of ties is small, 


Variance of fr 


‘In this paper ail significance tests are 
attributable to Kendall (7) 


sper tie 


unless there is a 
indication to the contrary. 
The variance of r when there are ties in 


both ra 


1S oc 


' 


nking contains ti 
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[5] may be used with only a slight 
error. Since the correction for ties 
will invariably reduce the variance, 
the use of the uncorrected formula 
will furnish a more conservative test 
of the null hypothesis. 

Kendall (7) provides probability 
tables for evaluating the significance 
of an obtained S (rather than its + 
when 210. Values of + required for 
significance at the .10, .05, and .01 
can take 
only a limited number of values) for 
n’s from 4 through 10 are shown in 
Table 1. 

When ties are present in one of the 
rankings, Sillitto’s tables (11) of the 
distribution of S for all possible num- 
bers of pair and triplet ties for small 
n’s may be used. When other types 
of ties are present, or when both rank- 
ings contain ties, the evaluation of 
r is not feasible if » is 10 or less. 


levels (or bevond, since T 


CORRECTION FOR CONTINUITY 


When the significance of 7 is evalu- 
ated using normal probability tables, 
it must be corrected for continuity, 
since S can not assume all values 
within the range +3n(m—1). Since 
nm is fixed, an increase in P is accom- 
panied by a decrease in Q, and the 


is a dichotomy consisting 


When one ranking 
of x and y members so that (x+y)=n, the 


Variance is 


= dry a x ee 
= pat" n— 2, (u®—u)}. 
_ 


Or ee 
3n'(n- 
The variance when one ranking is a dichot- 


omy and the other contains no ties is 


, 4txv(n+1) 


3n*(n— 1)? 
When both rankings are dichotomies with x 
and y, and # and g members respectively, the 
variant c becomes 
txvpg 
n?(n—1)° 
The above 
Kendall (7 


formulae are to be found in 
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minimum change in Sis thus 2. The 
appropriate correction for continuity 
is therefore to subtract 1 from the 
absolute value of S. This is equiva- 
lent to a deduction of 2, (m—1) from 
the absolute value of 7, and the correc- 
tion may be applied at either point. 

This simple correction is appropri- 
ate when neither distribution 
tains ties, or when onlv one has ties. 
When one ranking consists entirely 
of ties of extent «, and the other 
ranking is a dichotomy, the correc- 
tion sub- 
tracting u from S, or 2u/n(n—1) from 
r. If both variates are dichotomies, 
the deduction for continuity 
is 4m or 1/(n—1) from +. 

In instances where both rankings 
contain ties but are not dichotomies, 
there is no simple way of applving a 
correction. Whittield’s prop sed cor- 
rection (13) for the case in which one 
variate is a dichotomy and the other 
contains ties of varving extents might 
be used for the general case of ties 
in both rankings. Whitfield’s method 
involves arranging the undichoto- 
mized ranking in the natural order 
and subtracting the extent of the ties 
involving the smallest and the great- 
est rank from twice the number of 
items ranked. This quantity is then 
divided by the number of intervals 
in the ranking. One-half of this quo- 
tient is the deduction from S for the 
correction. If 7 is corrected instead of 
S, the deduction is the quotient di- 
vided by n(m—1). 
sion for this correction for S is 


con- 


for continuity consists of 


Phe formal expres- 


io | 


where 2 is the number of items ranked, 
7, Is the extent of the tie involving the 
smallest rank, ve is the extent of the 
tie involving the largest rank, and 
nm; is the number of intervals in the 
ranking. (If a ranking had no ties, 
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ny=(m—1); in a dichotomy, n,=1. 
In our illustrative problem (p. 340), 
a=8, v,=1, ve=1, and n;=4. Ac- 
cordingly, the deduction from S$ 
would be 


(2*8—1-1) 
2x4 ; 


1.75. 


The generalization of Whittield’s 
procedure to the general case of ties 
in both rankings is apparently not a 
simple matter, and it has not vet been 
accomplished. A suggestion would be 
to consider the ranking with the fewer 
intervals (and the most tied items 
as a dichotomy, and to apply Whit- 
field’s correction. This actually will 
provide an overcorrection for con 
tinuity and hence a safer test of the 
null. 


CONFIDENCE LIMITS OF T 


desirable to establish 
the parameter 
correlation when a significant sample 
coefficient kor 
any value of a population T, the sam- 
pling distribution of r tends rapidly 
toward normality (though 
rapidly as in-the null case), provided 
that the absolute value of T is not 
too close to unitv. The mean of the 
distribution is the population T, but 
the 
termined unless something is known 
about the arrangement of ranks in 
the population, information which is 
almost always lacking. However, it 
can be shown that for any parameter 
7 the vari ince Ol r cannot exceed the 
value 


It is often 


confidence limits for 


has been obtained. 


not so 


Variance cannot be exacth de 


1 
maximum o,*= 
n 


Confidence limits of T can be set by 
substituting the value of the sample 
rin [7]. An alternate method is to 
solve equation [8] with the roots pro- 


viding the limits. The value of x is 
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the normal deviate corresponding to 
the desired probability level. 


2 2x? 
} 


dy? 


[8] 


1+ 


nN 


Since the limits determined by 
means of [7] and [8] are based on a 
maximum variance, the probability 
is at least, but not precisely, (1—P) 
that the true T within those 
limits. Unless » is fairiv large, the 
magnitude of the limits will often be 
so great as to render them practically 
Kendall (7) has developed 
an additional method which involves 
the estimation of a parameter repre- 
senting the arrangement of ranks in 
the population from the obtained 
data. While this method frequently 
results in tremendous reductions in 
the extent of the confidence limits, it 
is too complicated and laborious for 


lies 


useless. 


ordinary use. 


SIGNIFICANCE OF A DIFFERENCE 
BETWEEN 7's 


Evaluating the significance of a dif- 
ference between two independent t's 
presents no special problems since 
such differences will be approximately 
normally distributed around a mean 
of zero in a test of the null hypothesis. 
The critical ratio which is 
tionally used in such 
applicable to r. The standard error of 
the difference is, as usual, \/o,,°+6,27 
where ¢,2 is computed by [7]. 


conven- 
situations 1s 


If we wish to avoid using the sam- 
ple ras an estimate of T in computing 
the variances, we have recourse to a 
transformation called w, which is de- 
fined as sin='r, in radians. Kendall 
(7) has shown that the sampling vari- 
ance of w can be maximized at 2/n, 
a value independent of the parameter 
w. The standard error of the differ- 


343 


ence between w,; and ws can be maxi- 


mized at 
(+) 
2(—+—}, 
| ny No 


an expression which does not require 
an estimation of population w’s from 
the data, 

The w transformation may also be 
used to set confidence limits for a T, 
though there is no reason to feel that 
this would be a desirable practice. 
Limits set in this manner, while dif- 
fering slightly from those determined 
by [8], cannot be said to be more ac- 
curate, since it is not known whether 
the distribution of w is nearer nor- 
mality than that of r. Furthermore, 
the computations involved in convert- 
ing from r+ to w and back again may 
very well exceed those required in 
solving [8] to obtain the limits. 


A ComMPpLeTE COMPUTATIONAL 
EXAMPLE 


Consider the following set of rank- 
ings where the first has been arranged 
in the natural order: 

ir, Bee 10 

6&6 209 7 & 2 

Computing S, we obtain +4, —§; 
2, —6:0, —7:;0, —6;0, —5;0, —4; 
2 —J;and +1,0. The total 
the total for Q is 36, and 
For the denominator, 
} 3(10)(9) =45. According 
to [1], r= —27/45= —.60. Entering 
Table 1 with an » of 10, we find that 
art ot .60 is significant beyond the .05 
level. The precise p value is .0166. 

If we wish to use the normal ap- 
proximation, we require the standard 
error of r, and we must correct 7 for 
continuity. From [5], we compute 
the variance of rt as .0617, and the 
standard error, .248. Applying the 
continuity correction at S, we re 
compute 7 from [1] thus: (—27+1) 45 
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=—.578. Or, correcting + itself; 
— .60+2/(10)(9) = —.578. (Since the 
correction is subtracted from the ab- 
solute value of S or 7, we add it toa 
negative statistic.) The critical ratio 
of r is thus —.578/.248 = 2.33, which 
corresponds to a probability of .0198, 
Comparing this value with the prob- 
ability obtained trom Table 1, we see 
that the normal approximation is 
slightly in error when #” is as small as 
10, though it provides a somewhat 
more stringent null test. 

To set the confidence limits of T 
at the .05 level and beyond, we solve 
[8] with r= —.60 and x=1.96. The 
roots of the quadratic are —.93 and 
+.25, which are the limits of T. The 
finding is hardly illuminating, though 
not unexpected. Any correlation 
based on only 10 instances is bound 
to be an uncertain estimate of the 
population value. If we had used [7] 
to compute the limits, we would ob- 
tain a maximum standard error of 
.358 and limits of —.60+.70 at the 
.05 level or bevond. 


PARTIAL RANK CORRELATION 


A procedure for computing a par- 
tial r when there are more than two 
rankings is described by Kendall (7). 
Suppose that we wish to determine 
the relationship between the rankings 
of Judges A and B with the ranking 
of Judge C held constant. Arrange 
the ranking of Judge C in the natural 
order, with those of Judges A and B 
beneath. There are n(n—1)/2 cou- 


MAURICE S. SCHAEFFER AND EUGENE E. LEVITT 


plets in each ranking, i.e., items 1 and 


2 1 and 3, . Land n, 2 and 3, etc. 
jn Judge C's ranking, the order of 
magnitude of each couplet ts the 
same; the one to the right is the 
larger. We determine (a) the number 
of couplets on which both Judge A 
and Judge B agreed with Judge C as .- 
to order, (6) the number of couplets 
on which both disagreed with Judge 
C, (c) the number on which A agreed 
and B disagreed, and (d) the number 
on which B agreed and A disagreed. 

These frequencies are now arranged 
in an ordinary 2X2 contingency table 
and the partial 7 of the rankings of 
Judges A and B independent of that 
of Judge C is defined as 


ab—cd 
TaB-C= ; — 9] 
V (a+c)(a+d)(6+0¢)(b+d) 


It so happens that 


| > [10] 
V 1—tac?V1—Tac* 


an expression which is analogous to 
that for the product-moment partial 
correlation coefficient. It happens 
further that rtyan-c Vx2in, which 
illustrates the relationship between 
partial tr and the phi coefficient. 
Examples of the computation of 
partial + using [9] can be found in 
Kendall (7) and Smith (12). The lat- 
ter's example, though correct in form, 
contains arithmetic errors so that the 
computed partial is inaccurate. For 


rABLE 1 


VALUES OF + REQUIRED FOR SIGNIFICANCE AT THE .10, 


Level " 6 


05 anp .O1 LEVELS AND Breyonpb* 


8 10 


.10 0.80 0.73 
JOS 1.00 0.87 
01 1.00 


0.62 0.57 
0.71 0.64 
0.90 0.79 


* Based on Kendall's (7) tables 
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most purposes [10] will be the more 
useful computational method. 

The use of partial r when ties are 
present is questionable since [9] and 
{10} will give different results in such 
instances. This drawback, added to 
the fact that generally applicable 
tests of the significance of any partial 
7 are not yet available, limits the 
value of the statistic.® 

An expression for a multiple 7 has 
been developed by Moran (10), but 
the problems of the sampling distri- 
bution of multiple +r, although ap- 
parently simpler than those of partial 
7, have also not yet been solved. The 
usefulness of multiple +, like that of 
partial +r, is limited at the present 


time. 


THE RELATIONSHIP 
BETWEEN 7 AND r 


When ranked data can be assumed 
to be based on continuous, normal 
distributions and is fairly large, an 
estimate of the parameter product- 
moment coefficient can be obtained 
by means of a transformation of +. 
The formula for this transformation 


radians 


' 


=sin 907 (degrees). ‘11 | 
The significance of the estimated 
r can be tested by simply testing the 
7 from which it was derived, using 
normal tables and a variance com- 
puted by [5]. 
In the nonnull case, the distribu 


* Hoeffding (5) shows that when neither 
Tac nor Tpe is unity, the distribution of yn 
(ran-c—Tap-c) is approximately normal for 
large n’s with a mean of zero and a variance 
given by an expression which he derives. 
Furthermore, when Tac and Tse are zero, 
the distribution of y #(ran-c—Tap.c) is the 
same, in the limit, as that of y »(ran— Tap). 
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tion of sample r’s will be approxi- 
mately normal for large n’s, with a 
mean of T and a maximum variance 
of 


5(1—r?) 


maximum o,?= 
1—T) 


Confidence limits for T can be ob- 
tained using this variance, and cor- 
responding limits for the transformed 
r are computed by translating the 
limits of T into those for 7 using [11].? 

A comparison of the upper limit of 
the variance of r by [12] when nor- 
mality is assumed with its upper 
limit by [7] when no assumptions are 
made will show that the assumption 
of normality decreases the standard 
error of + by approximately 50 per 
cent in the nonnull case. On the other 
hand, if 7 is used to estimate 7 when 
the latter could be computed directly 
from the data, there will be a con- 
siderable loss of sensitivity since the 
standard error of the former is always 
greater than that of the latter. The 
ratio of the standard errors will vary 
from 1.2 when the variates are uncor- 
related up to approximately 1.9 when 
the true 7 is .90, 

The conversion formula for r from 
r is justified only by the assumption 
of normality of distribution of the 
variates, and when m is fairly large. 
Otherwise, it would seem advisable 
to avoid estimating r from ranked 
data, and to limit the conclusions to 
statements concerning rf. 


7A standard error for r computed from + 
can be derived using the conversion formula 
(7). Its upper limit is 


Ty) 


4 


The procedure for setting limits for r by con- 
verting limiting r's into limiting r’s is, how- 
ever, preferable because of the greater sym- 
metry of the distribution of r. 


n—1 
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